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(57) Abstract: The sequences of antisense nucleic acids which inhibit the proliferation of prokaryotes are disclosed. Cell-based 
assays which employ the antisense nucleic acids to identify and develop antibiotics are also disclosed. The antisense nucleic acids 
can also be used to identify proteins required for proliferation, express these proteins or portions thereof, obtain antibodies capable 
^ of specifically binding to the expressed proteins, and to use those expressed proteins as a screen to isolate candidate molecules 
for rational drug discovery programs. The nucleic acids can also be used to screen for homologous nucleic acids that are required 
for proliferation in cells other than Staphylococcus aureus, Salmonella typhimurium, Klebsiella pneumoniae, and Pseudomonas 
aeruginosa. The nucleic acids of the present invention can also be used in various assay systems to screen for proliferation required 
^ genes in other organisms. 
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IDENTIFICATION OF ESSENTIAL GENES IN MICROORGANISMS 

Sequence jLigtjng 

The present application is being filed along with quadruplicate copies of a CD-ROM 
marked "Copy 1 - SEQUENCE LISTING PART," "Copy 2 - SEQUENCE LISTING PART/' 
5 "Copy 3 - SEQUENCE LISTING PART," and "CRF" containing a Sequence Listing in electronic 
format. The quadruplicate copies of the CD-ROM each contain a file entitled 
034VPC_final.ST25.txt, created on March 15, 2002, which is 181,323,311 bytes in size. 

ttflr.lcgrmind of the Invention 
Since the discovery of penicillin, the use of antibiotics to treat the ravages of bacterial 
10 infections has saved millions of lives. With the advent of these "miracle drugs," for a time it was 
popularly believed that humanity might, once and for all, be saved from the scourge of bacterial 
infections. In fact, during the 1980s and early 1990s, many large pharmaceutical companies cut 
back or eliminated antibiotics research and development. They believed that infectious disease 
caused by bacteria finally had been conquered and that markets for new drugs were limited. 
1 5 Unfortunately, this belief was overly optimistic. 

The tide is beginning to turn in favor of the bacteria as reports of drug resistant bacteria 
become more frequent. The United States Centers for Disease Control announced that one of the most 
powerful known antibiotics, vancomycin, was unable to treat an infection of the common 
Staphylococcus aureus (staph). This organism is commonly found in our environment and is 
20 responsible for many nosocomial infections. The import of this announcement becomes clear when 
one considers that vancomycin was used for years to treat infections caused by Staphylococcus 
species as well as other stubborn strains of bacteria. In short, bacteria are becoming resistant to our 
most powerful antibiotics. If this trend continues, it is conceivable that we will return to a time 
when what are presently considered minor bacterial infections are fatal diseases. 
25 Over-prescription and improper prescription habits by some physicians have caused an 

indiscriminate increase in the availability of antibiotics to the public. The patients are also partly 
responsible, since they will often improperly use the drug, thereby generating yet another population of 
bacteria that is resistant, in whole or in part, to traditional antibiotics. 

The bacterial pathogens that have haunted humanity remain, in spite of the development of 
30 modem scientific practices to deal with the diseases that they cause. Drug resistant bacteria are now an 
increasing threat to the health of humanity. A new generation of antibiotics is needed to once again 
deal with the pending health threat that bacteria present. 

Discovery of New Antibiotics 
As more and more bacterial strains become resistant to the panel of available antibiotics, 
35 new antibiotics are required to treat infections. In the past, practitioners of pharmacology would 
have to rely upon traditional methods of drug discovery to generate novel, safe and efficacious 
compounds for the treatment of disease. Traditional drug discovery methods involve blindly testing 
potential drug candidate-molecules, often selected at random, in the hope that one might prove to be 
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an effective treatment for some disease. The process is painstaking and laborious, with no 
guarantee of success. Today, the average cost to discover and develop a new drug exceeds US $500 
million, an"d the average time from laboratory to patient is 15 years. Improving this process, even 
incrementally, would represent a huge advance in the generation of novel antimicrobial agents. 
5 Newly emerging practices in drug discovery utilize a number of biochemical techniques to 

provide for directed approaches to creating new drugs, rather than discovering them at random. For 
example, gene sequences and proteins encoded thereby that are required for the proliferation of a 
cell or microorganism make excellent targets since exposure of bacteria to compounds active 
against these targets would result in the inactivation of the cell or microorganism. Once a target is 
10 identified, biochemical analysis of that target can be used to discover or to design molecules that 
interact with and alter the functions of the target. Use of physical and computational techniques to 
analyze structural and biochemical properties of targets in order to derive compounds that interact 
with such targets is called rational drug design and offers great potential. Thus, emerging drug 
discovery practices use molecular modeling techniques, combinatorial chemistry approaches, and 
15 other means to produce and screen and/or design large numbers of candidate compounds. 

Nevertheless, while this approach to drug discovery is clearly the way of the future, 
problems remain. For example, the initial step of identifying molecular targets for investigation can 
be an extremely time consuming task. It may also be difficult to design molecules that interact with 
the target by using computer modeling techniques. Furthermore, in cases where the function of the 
20 target is not known or is poorly understood, it may be difficult to design assays to detect molecules 
that interact with and alter the functions of the target. To improve the rate of novel drug discovery 
and development, methods of identifying important molecular targets in pathogenic cells or 
microorganisms and methods for identifying molecules that interact with and alter the functions of 
such molecular targets are urgently required. 
25 Escherichia coli represents an excellent model system to understand bacterial biochemistry 

and physiology. The estimated 4288 genes scattered along the 4.6 x 10 6 base pairs of the 
Escherichia coli (E. coli) chromosome offer tremendous promise for the understanding of bacterial 
biochemical processes. In turn, this knowledge will assist in the development of new tools for the 
diagnosis and treatment of bacteria-caused human disease. The entire E. coli genome has been 
30 sequenced, and this body of information holds a tremendous potential for application to the 
discovery and development of new antibiotic compounds. Yet, in spite of tliis accomplishment, the 
general functions or roles of many of these genes are still unknown. For example, the total number 
of proliferation-required genes contained within the E. coli genome is unknown, but has been 
variously estimated at around 200 to 700 (Armstrong, KA. and Fan, D.P. Essential Genes in the 
35 metB-malB Region of Escherichia coli K12, 1975, J. Bacterid. 126: 48-55). 

Staphylococcus aureus is a Gram positive microorganism which is the causative agent of 
many infectious diseases. Local infection by Staphylococcus aureus can cause abscesses on skin 
and cellulitis in subcutaneous tissues and can lead to toxin-related diseases such as toxic shock and 

2 
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scalded skin syndmes. Staphylococcus aureus can cause serious systemic infections such as 
osteomyelitis, endmrditis, pneumonia, and septicemia. Staphylococcus aureus is also a common 
cause of food poiMning, often arising from contact between prepared food and infected food 
industry workers. Antibiotic resistant strains of Staphylococcus aureus have recently been 
5 identified, includii* those that are now resistant to all available antibiotics, thereby severely 
limiting the optionsof care available to physicians. 

Pseudomoms aeruginosa is an important Gram negative opportunistic pathogen. It is the 
most common Graanegative found in nosocomial infections. P. aeruginosa is responsible for 16% 
of nosocomial pnemonia cases, 12% of hospital-acquired urinary tract infections, 8% of surgical 
10 wound infections, md 10% of bloodstream infections. Immunocompromised patients, such as 
neutropenic cancer and bone marrow transplant patients, are particular susceptible to opportunistic 
infections. In this ^oup of patients, P. aeruginosa is responsible for pneumonia and septicemia 
with attributable deaths reaching 30%. P. aeruginosa is also one of the most common and lethal 
pathogens responable for ventilator-associated pneumonia in intubated patients, with directly 
15 attributable death ntes reaching 38%. Although P. aeruginosa outbreaks in burn patients are rare, it 
is associated with (0% death rates. In the AIDS population, P. aeruginosa is associated with 50% 
of deaths. Cystic fibrosis patients are characteristically susceptible to chronic infection by P. 
aeruginosa, which b responsible for high rates of illness and death. Current antibiotics work poorly 
for CF infections (Van Delden & Igelwski. 1998. Emerging Infectious Diseases 4:551-560; 
20 references therein). 

The gram negative enteric bacterial genus, Salmonella, encompasses at least 2 species. One 
of these, S. enterica, is divided into multiple subspecies and thousands of serotypes or serovars 
(Brenner, et al. 2000 J. Clin. Microbiol. 38:2465-2467). The S enterica human pathogens include 
serovars Typhi, Paratyphi, Typhimurium, Cholerasuis, and many others deemed so closely related 
25 that they are variants of a widespread species. Worldwide, disease in humans caused by Salmonella 
is a very serious problem. In many developing countries, S. enterica ser. Typhi still causes often- 
fatal typhoid fever. This problem has been reduced or eliminated in wealthy industrial states. 
However, enteritis induced by Salmonella is widespread and is the second most common disease 
caused by contaminated food in the United States (Edwards, BH 1999 "Salmonella and Shigella 
30 species" Clin. Lab Med. 19(3):469-487). Though usually self-limiting in healthy individuals, others 
such as children, seniors, and those with compromising illnesses can be at much greater risk of 
serious illness and death. 

Some S. enterica serovars (e.g. Typhimurium) cause a localized infection in the 
gastrointestinal tract. Other serovars (i.e. Typhi and Paratyphi) cause a much more serious systemic 
35 infection. In animal models, these roles can be reversed which has allowed the use of the relatively 
safe S. enterica ser. Typhimurium as a surrogate in mice for the typhoid fever agent, S. enterica ser. 
Typhi. In mice, S. enterica ser Typhimurium causes a systemic infection similar in outcome to 
typhoid fever. Years of study of the Salmonella have led to the identification of many determinants 
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of virulence in animals and humans. Salmonella is interesting in its ability to localize to ana mvaae 
the intestinal epithelium, induce morphologic changes in target cells via injection of certain cell- 
remodeling proteins, and to reside intracellular^ in membrane-bound vesicles (Wallis, TS and 
Galyov, EE 2000 'Molecular basis of Salmonella-inductd enteritis." Molec. Microb. 36:997-1005; 

5 Falkow, S "The evolution of pathogenicity in Escherichia, Shigella, and Salmonella," Chap. 149 in 
Neidhardt, et al. eds pp 2723-2729; Gulig, PA "Pathogenesis of Systemic Disease," Chap. 152 in 
Neidhardt, et al. ppp 2774-2787). The immediate infection often results in a severe watery diarrhea 
but Salmonella also can establish and maintain a subclinical carrier state in some individuals. 
Spread is via food contaminated with sewage. 

10 The gene products implicated in Salmonella pathogenesis include type three secretion 

systems (TTSS), proteins affecting cytoplasmic structure of the target cells, many proteins carrying 
out functions necessary for survival and proliferation of Salmonella in the host, as well as 
"traditional" factors such as endotoxin and secreted exotoxins. Additionally, there must be factors 
mediating species-specific illnesses. Despite this most of the genomes of S. enterica ser. Typhi (see 

15 http://www.sanger.ac.uk/Projects/S_typhi/ for the genome database) and S. enterica ser. 
Typhimurium (see http://genome.wustl.edu/gsc/bacterial/salmonella.shtml for the genome database) 
are highly conserved and are mutually useful for gene identification in multiple serovars. The 
Salmonella are a complex group of enteric bacteria causing disease similar to but distinct from 
other gram negative enterics such as E. coli and have been a focus of biomedical research for the 

20 last century. 

Enter ococciis faecalis, a Gram positive bacterium, is by far the most common member of 
the enterococci to cause infections in humans. Enterococcus faecium generally accounts for less 
than 20% of clinical isolates. Enterococci infections are mostly hospital-acquired though they are 
also associated with some community-acquired infections. Of nosocomial infections enterococci 

25 account for 12% of bacteremia, . 15% of surgical wound infections, 14% of urinary tract infections, 
and 5 tol5% of endocarditis cases (Huycke, M. M., D. F., Sahm and M. S. Gilmore. 1998. 
Emerging Infectious Diseases 4:239-249). Additionally enterococci are frequently associated with 
intraabdominal and pelvic infections. Enterococci infections are often hard to treat because they are 
resistant to a vast array of antimicrobial drugs, including aminoglycosides, penicillin, ampicillin 

30 and vancomycin. The development of multiple-drug resistant (MDR) enterococci has made this 
bacteria a major concern for treating nosocomial infections. 

Current drug discovery methods involve screening large number of prospective therapeutic 
compounds to identify those that are effective therapeutic agents or that can be optimized to provide 
an effective therapeutic agents. For example, the compounds to be evaluated for therapeutic 

35 activity may be members of a library of compounds generated by combinatorial chemistry or 
members of a library of natural products. 

Unfortunately, current methods are laborious and time consuming and may yield 
compounds which have already been identified or which act on gene products which are already 
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targeted by an existing therapeutic agent. In addition, a large number of compounds have been 
identified which have antimicrobial activity but which cannot be administered to individuals 
suffering from infection due to the fact that their targets are unknown. 

The above reasons underscore the urgency of developing new antibiotics that are effective 
5 against Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, 
Pseudomonas aeruginosa, and Salmonella typhimurium. Accordingly, there is an urgent need for 
more novel methods to identify and characterize bacterial genomic sequences that encode gene 
products involved in proliferation, and are thereby potential new targets for antibiotic development. 
Likewise, there is a need for rapid screening techniques which yield novel compounds or 
10 compounds which act on novel targets as well as a need for methods which permit the identification 
of the target on which a compound with antimicrobial activity acts. 

Prior to the present invention, the discovery of Escherichia coli, Staphylococcus aureus, 
Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas aeruginosa, and Salmonella 
typhimurium genes required for proliferation of the microorganism was a painstaking and slow 
15 process. Rapid screening techniques for identifying novel targets on which novel compounds act 
were undeveloped. While the detection and identification of new cellular drug targets within a 
Eschenchia coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, 
Pseudomonas aeruginosa, and Salmonella typhimurium cell is key for novel antibiotic development 
and effective treatment, the current methods of drug target discovery available prior to this 
20 invention have required painstaking processes requiring years of effort. 

Summary of the Invention 
Some aspects of the present invention are described in the numbered paragraphs below. 

1. A purified or isolated nucleic acid sequence comprising a nucleotide sequence 
consisting essentially of one of SEQ ED NOs: 1-6213, wherein expression of said nucleic acid 

25 inhibits proliferation of a cell. 

2. The nucleic acid sequence of Paragraph 1, wherein said nucleotide sequence is 
complementary to at least a portion of a coding sequence of a gene whose expression is required for 
proliferation of a cell. 

3. The nucleic acid of Paragraph 1, wherein said nucleic acid sequence is 
30 complementary to at least a portion of a nucleotide sequence of an RNA required for proliferation 

of a cell. 

4. The nucleic acid of Paragraph 3, wherein said RNA is an RNA comprising a 
sequence of nucleotides encoding more than one gene product. 

5. A purified or isolated nucleic acid comprising a fragment of one of SEQ ID NOs.: 
35 1-6213, said fragment selected from the group consisting of fragments comprising at least 10, at 

least 20, at least 25, at least 30, at least 50 and more than 50 consecutive nucleotides of one of SEQ 
ID NOs: 1-6213. 
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6. The fragment of Paragraph 5, wherein said fragment is included in a nucleic acid 
obtained from an organism selected from the group consisting of Acinetobacier baumannii, 
Anaplasma marginale, Aspergillus fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella 
pertussis, Borrelia burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkholderia 
5 mallei, Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis 
glabrata), Candida tropicalis, Candida parapsilosis, Candida guilliemiondii, Candida Icrusei, 
Candida kejyr (also called Candida pseudotropicalis), Candida dubliniensis, Chlamydia 
pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium botulinum, 
Clostridium difficile, Clostridium perfringens, Coccidioides immitis, Corynebacterium diptheriae, 
10 Cryptococcus neoformans, Enter obacter cloacae, Enterococcus faecalis, Enterococcus faecium, 
Escherichia coli, Haemophilus influenzae, Helicobacter pylori, Histoplasma capsidatum, Klebsiella 
pneumoniae, Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, 
Mycobacterium avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, 
Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, 
15 Nocardia asteroides, Pasteurella haemolytica, Pasteurella multocida, Pneumocystis carinii, 
Proteus mirabilis, Proteus vulgaris, Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas 
syringae, Salmonella bongori, Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, 
Salmonella typhi, Salmonella iyphimurium, Shigella boydii, Shigella dysenteriae, Shigella ftexneri, 
Shigella sonnei, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, 
20 Streptococcus pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, 
Ureaplasma urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia 
enterocolitica, Yersinia pestis and any species falling within the genera of any of the above species. 

1. The fragment of Paragraph 5, wherein said fragment is included in a nucleic acid 
obtained from an organism other than Escherichia coli. 
25 8. A vector comprising a promoter operably linked to the nucleic acid of any one of 

Paragraphs 1-7. 

9. The vector of Paragraph 8, wherein said promoter is active in a microorganism 
selected from the group consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus 
fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, 

30 Burklioldena cepacia, Burkholderia fungorum, Burkholdeiia mallei, Campylobacter jejuni, 
Candida albicans, Candida glabrata (also called Tondopsis glabrata), Candida tropicalis, Candida 
parapsilosis, Candida guilliermondii, Candida husei, Candida kejyr (also called Candida 
pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, 
Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium perftingens, 

35 Coccidioides immitis, Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter 
cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, 
Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 

6 
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Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 

5 Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella Jlexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutatis, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 

1 0 Yersinia pestis and any species falling within the genera of any of the above species. 

10. A host cell containing the vector of Paragraph 8 or Paragraph 9. 

11. A purified or isolated antisense nucleic acid comprising a nucleotide sequence 
complementary to at least a portion of an intragenic sequence, intergenic sequence, sequences 
spanning at least a portion of two or more genes, 5' noncoding region, or 3' noncoding region 

15 within an operon comprising a proliferation-required gene whose activity or expression is inhibited 
by an antisense nucleic acid comprising the nucleotide sequence of one of SEQ ID NOs.: 1-6213. 

12. The purified or isolated antisense nucleic acid of Paragraph 11, wherein said 
antisense nucleic acid is complementary to a nucleic acid from an organism selected from the group 
consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, Bacillus 

20 anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burldiolderia cepacia, 
Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabrata (also called Tondopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
guilliermondii, Candida Icrusei, Candida kefyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 

25 Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
Corynebacterium diptheriae, Cryptococcus neofonnans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 

30 Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongoii, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 

35 typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
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urealyticum, Vibrio cholerae, Vibrio parahaemofyticus, Vibrio vuhificans, Yersinia Siterocbliiica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

13. The purified or isolated antisense nucleic acid of Paragraph 11, wherein said 
nucleotide sequence is complementary to a nucleotide sequence of a nucleic acid from an organism 

5 other than E. coli. 

14. The purified or isolated antisense nucleic acid of Paragraph 11, wherein said 
proliferation-required gene comprises a nucleotide sequence selected from the group consisting of 
SEQ ID NOS.: 6214-42397. 

15. A purified or isolated nucleic acid comprising a nucleotide sequence having at least 
10 70% identity to a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, 

fragments comprising at least 25 consecutive nucleotides of SEQ ID NOs.: 1-6213, the nucleotide 
sequences complementary to SEQ ID NOs.: 1-6213 and the sequences complementary to fragments 
comprising at least 25 consecutive nucleotides of SEQ ID NOs.: 1-6213 as determined using 
BLASTN version 2,0 with the default parameters. 

15 16. The purified or isolated nucleic acid of Paragraph 15, wherein said nucleic acid is 

obtained from an organism selected from the group consisting of Acinetobacter baumannii, 
Anaplasma marginale, Aspergillus fumigates, Bacillus anthracis, Bacteroides fragilis, Bordetella 
pertussis, Borrelia burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkholderia 
mallei, Campylobacter jejuni, Candida albicans, Candida glabrata (also called Tondopsis 

20 glabrata), Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida Icrusei, 
Candida kefyr (also called Candida pseudotropicalis\ Candida dubliniensis, Chlamydia 
pneumoniae^ Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium boiulinum, 
Clostridium difficile, Clostridium perfringens, Coccidioides immitis, Corynebacterium diptheriae, 
Qyptococcus neoformans, Enterobacter cloacae, Enterococcus faecalis, Enterococcus faecium, 

25 Escherichia coli, Haemophilus influenzae, Helicobacter pylon, Histoplasma capsulatum, Klebsiella 
pneumoniae, Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, 
Mycobacterium avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, 
Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria gononhoeae, Neisseria meningitidis, 
Nocardia asteroides, Pasteurella haemolytica, Pasteurella multocida, Pneumocystis carinii, 

30 Proteus mirabilis, Proteus vulgaris, Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas 
syringae, Salmonella bongori, Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, 
Salmonella typhi, Salmonella typhimuriwn, Shigella boydii, Shigella dysenteriae, Shigella flexneri, 
Shigella sonnei, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, 
Streptococcus pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, 

35 Ureaplasma urealyticwn, Vibrio cholerae, Vibrio parahaemofyticus, Vibrio vulnificans, Yersinia 
enterocolitica, Yersinia pestis and any species falling within the genera of any of the above species. 

17. The nucleic acid of Paragraph 15, wherein said nucleic acid is obtained from an 
organism other than E. coli. 

8 
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18. A vector comprising a promoter operably linked to a nucleic acid encoding a 
polypeptide whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence of any one of SEQ ID NOs.: 1-6213. 

19. The vector of Paragraph 1 8, wherein said nucleic acid encoding said polypeptide is 
5 obtained from an organism selected from the group consisting of Acinetobacter baumannii, 

Anaplasma marginale, Aspergillus fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella 
pertussis, Bon-elia burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkholderia 
mallei, Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis 
glabrata), Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida krusei, 

10 Candida kejyr (also called Candida pseudotropicalis), Candida dubliniensis, Chlamydia 
pneumoniae, Chlamydia trachomatis, Closti'idium acetobutylicum, Clostridium botulinum, 
Clostridium difficile, Clostridium perfringens, Coccidioides immitis, Corynebacterium diptheriae, 
Ciyptococcus neoformans, Enterobacter cloacae, Enterococcus faecalis, Enterococcus faecium, 
Escherichia coli, Haemophilus influenzae, Helicobacter pylon, Histoplasma capsulatum, Klebsiella 

15 pneumoniae, Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, 
Mycobacterium avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, 
Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, 
Nocardia asteroides, Pasteurella haemolytica, Pasteurella multocida, Pneumocystis carinii, 
Proteus mirabilis, Proteus vulgaris, Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas 

20 syringae, Salmonella bongori, Salmonella cholerasais, Salmonella enterica, Salmonella paratyphi, 
Salmonella typhi, Salmonella typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexnen, 
Shigella sonnei, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, 
Streptococcus pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, 
Ureaplasma urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia 

25 enterocolitica, Yersinia pestis and any species falling within the genera of any of the above species. 

20. The vector of Paragraph 18, wherein said nucleotide sequence encoding said 
polypeptide is obtained from an organism other than E. coli. 

21. A host cell containing the vector of Paragraph 1 8 . 

22. The vector of Paragraph 18, wherein said polypeptide comprises a polypeptide 
30 comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 42398- 

78581. 

23. The vector of Paragraph 18, wherein said promoter is operably linked to a nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214- 
42397. 

35 24. A purified or isolated polypeptide comprising a polypeptide whose expression is 

inhibited by an antisense nucleic acid comprising a nucleotide sequence of any one of SEQ ID 
NOs.: 1-6213, or a fragment selected from the group consisting of fragments comprising at least 5, 
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at least 10, at least 20, at least 30, at least 40, at least 50, at least 60 or more than 60 consecutive 
amino acids of one of the said polypeptides. 

25. The polypeptide of Paragraph 24, wherein said polypeptide comprises an amino 
acid sequence of any one of SEQ ED NOs.: 42398-78581 or a fragment comprising at least 5, at 

5 least 10, at least 20, at least 30, at least 40, at least 50, at least 60 or more than 60 consecutive 
amino acids of a polypeptide comprising an amino acid sequence selected from the group consisting 

of SEQ ID NOs.: 42398-78581. 

26. The polypeptide of Paragraph 24, wherein said polypeptide is obtained from an 
organism selected from the group consisting of Acinetobacter baumannii, Anaplasma marginale, 

10 Aspergillus fumigatus, Bacillus anthracis, Bacteroides flagilis, Bordetella pertussis, Borrelia 
burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter 
jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, 
Candida parapsilosis, Candida guilliermondii, Candida kntsei, Candida kefyr (also called Candida 
pseudotropicalis) y Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, 

15 Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, 
Coccidioides immitis, Coiynebacterium diptheriae t Ciyptococcus neoformans, Enterobacter 
cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, 
Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 

20 Mycobacterium leprae, Mycobacterium tubercidosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella midtocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 

25 typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

30 27. The polypeptide of Paragraph 24, wherein said polypeptide is obtained from an 

organism other than E. coli. 

28. A purified or isolated polypeptide comprising a polypeptide having at least 25% 
amino acid identity to a polypeptide whose expression is inhibited by a nucleic acid comprising a 
nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, or at least 25% 

35 amino acid identity to a fragment comprising at least 10, at least 20, at least 30, at least 40, at least 
50, at least 60 or more than 60 consecutive amino acids of a polypeptide whose expression is 
inhibited by a nucleic acid comprising a nucleotide sequence selected from the group consisting of 
SEQ ID NOs.: 1-6213 as determined using FASTA version 3.0t78 with the default parameters. 

10 
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29. The polypeptide of Paragraph 28, wherein- said polypeptide has at least 25% 
identity to a polypeptide comprising one of SEQ ID NOs: 42398-7858 1 or at least 25% identity to a 
fragment comprising at least 5, at least 10, at least 20, at least 30, at least 40, at least 50, at least 60 
or more than 60 consecutive amino acids of a polypeptide comprising one of SEQ ID NOs.: 42398- 

5 78581 as determined using FASTA version 3.0t78 with the default parameters. 

30. The polypeptide of Paragraph 28, wherein said polypeptide is obtained from an 
organism selected from the group consisting of Acinetobacter baumannii, Anaplasma marginale, 
Aspergillus fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia 
burgdorferi, Burkholderia cepacia, Burldiolderia fiingorum, Burkholderia mallei, Campylobacter 

10 jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), Candida fropicalis, 
Candida parapsilosis, Candida guilliermondii, Candida Icrusei, Candida kefyr (also called Candida 
pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, 
Clostridium acetobutylicum, Clostridium Botulinum, Clostridium difficile, Clostridium perfringens, 
Coccidioides immitis, Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter 

15 cloacae, Enter ococcus faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, 
Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberctdosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 

20 haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnet, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 

25 pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vtdnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

31. The polypeptide of Paragraph 28, wherein said polypeptide is obtained from an 
organism other than E. coli. 

30 32. An antibody capable of specifically binding the polypeptide of one of Paragraphs 

28-31. 

33. A method of producing a polypeptide, comprising introducing a vector comprising 
a promoter operably linked to a nucleic acid comprising a nucleotide sequence encoding a 
polypeptide whose expression is inhibited by an antisense nucleic acid comprising one of SEQ ID 

35 NOs.: 1-6213 into a cell. 

34. The method of Paragraph 33, further comprising the step of isolating said 
polypeptide. 
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35. The method of Paragraph 33, wherein said polypeptide comprises an amino acid 
sequence selected from the group consisting of SEQ ID NOs.: 42398-78581. 

36. The method of Paragraph 33, wherein said nucleic acid encoding said polypeptide 
is .obtained from an organism selected from the group consisting of Acinetobacter baumannii, 

5 Anaplasma marginale, Aspergillus fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella 
pertussis, Borrelia burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkholderia 
mallei, Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis 
glabrata), Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida krusei t 
Candida kefyr (also called Candida pseudotropicalis\ Candida dubliniensis, Chlamydia 

10 pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Closiiidium botulinum, 
Clostridium difficile, Clostridium peijringens, Coccidioides immitis, Corynebacterium diptheriae, 
Ciyptococcus neoformans, Enterobacter cloacae, Enterococcus faecalis, Enterococcus faeciwn, 
Escherichia coli, Haemophilus influenzae, Helicobacter pylori, Histoplasma capsulatum, Klebsiella 
pneumoniae, Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, 

15 Mycobactenum avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, 
Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, 
Nocardia asteroides, Pasteurella haemolytica, Pasteurella multocida, Pneumocystis carinii, 
Proteus mirabilis, Proteus vulgaris, Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas 
syringae, Salmonella bongori, Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, 

20 Salmonella typhi, Salmonella typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, 
Shigella sonnei t Staphylococcus aureus, Staphylococcus epidennidis, Staphylococcus haemolyticus, 
Streptococcus pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, 
Ureaplasma urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia 
enterocolitica, Yersinia pestis and any species falling within the genera of any of the above species. 

25 37. The method of Paragraph 33, wherein said nucleic acid encoding said polypeptide 

is obtained from an organism other than E. coli. 

38. The method of Paragraph 33, wherein said promoter is operably linked to a nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214- 
42397. 

30 39. A method of inhibiting proliferation of a cell in an individual comprising inhibiting 

the activity or reducing the amount of a gene product whose expression is inhibited by an antisense 
nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 
1-6213 or inhibiting the activity or reducing the amount of a nucleic acid encoding said gene 
product. 

35 40. The method of Paragraph 39, wherein said method comprises inhibiting said 

activity or reducing said amount of a gene product in an organism selected from the group 
consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, Bacillus 
anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
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Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
guilliermondii, Candida krusei, Candida Izefyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
5 Clostridium botulinwn, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
10 Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aemginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
15 typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica,. 
Yersinia pestis and any species falling within the genera of any of the above species. 
20 41. The method of Paragraph 39, wherein said method comprises inhibiting said 

activity or reducing said amount of a gene product in an organism other than E. coli. 

42. The method of Paragraph 39, wherein said gene product is present in an organism 
other than E. coli. 

43. The method of Paragraph 39, wherein said gene product comprises a polypeptide 
25 comprising a sequence selected from the group consisting of SEQ ID NOs.: 42398-7858 1 . 

44. A method for identifying a compound which influences the activity of a gene 
product required for proliferation, said gene product comprising a gene product whose expression is 
inhibited by an antisense nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 1-6213, said method comprising: 

30 contacting said gene product with a candidate compound; and 

determining whether said compound influences the activity of said gene product. 

45. The method of Paragraph 44, wherein said gene product is from an organism 
selected from the group consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus 
fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Bonelia burgdorferi, 

35 Burkholderia cepacia, Burkholderia fungomm, Burldiolderia mallei, Campylobacter jejuni, 
Candida albicans, Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida 
parapsilosis, Candida guilliermondii, Candida Icrusei, Candida kefyr (also called Candida 
pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, 
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Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium peifiingens, 
Coccidioides immitis, Corynebacterium diptheriae, Qyptococcus neoformans, Enterobacter 
cloacae, Enter ococcus faecalis, Enterococcus faeciwn, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, 
5 Listena monocytogenes, Moraxella catarrhal, Mycobacterium avium, Mycobactenum bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
10 Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutatis, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificus, Yersinia enterocolitica, 
1 5 Yersinia pestis and any species falling within the genera of any of the ahove species. 

46. The method of Paragraph 44, wherein said gene product is from an organism other 
than E. colt 

47. The method of Paragraph 44, wherein said gene product is a polypeptide and said 
activity is an enzymatic activity. 

20 48. The method of Paragraph 44, wherein said gene product is a polypeptide and said 

activity is a carbon compound catabolism activity. 

49. The method of Paragraph 44, wherein said gene product is a polypeptide and said 
activity is a biosynthetic activity. 

50. The method of Paragraph 44, wherein said gene product is a polypeptide and said 
25 activity is a transporter activity. 

51. The method of Paragraph 44, wherein said gene product is a polypeptide and said 
activity is a transcriptional activity. 

52. The method of Paragraph 44, wherein said gene product is a polypeptide and said 
activity is a DNA replication activity. 

30 53. The method of Paragraph 44, wherein said gene product is a polypeptide and said 

activity is a cell division activity. 

54. The method of Paragraph 44, wherein said gene product is an RNA. 

55. The method of Paragraph 44, wherein said gene product is a polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ ED NOs.: 42398- 

35 78581. 

56. A compound identified using the method of Paragraph 44. 

57. A method for identifying a compound or nucleic acid having the ability to reduce 
the activity or level of a gene product required for proliferation, said gene product comprising a 
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gene product whose activity or expression is inhibited by an antisense nucleic acid comprising a 
nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, said method 
comprising: 

(a) contacting a target gene or RNA encoding said gene product with a candidate 
5 compound or nucleic acid; and 

(b) measuring an activity of said target. 

58. Hie method of Paragraph 57, wherein said target gene or RNA is from an organism 
selected from the group consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus 
fumigatus, Bacillus anthracis, Bacteroides jragilis, Bordetella pertussis, Boirelia burgdorferi, 

10 Burkholderia cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, 
Candida albicans, Candida glabrata (also called Tondopsis glabrata), Candida tropicalis, Candida 
parapsilosis, Candida guilliennondii, Candida Icrusei, Candida kefyr (also called Candida 
pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, 
Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, 

15 Coccidioides immitis, Corynebacterium dipiheriae, Oyptococcus neoformans, Enterobacter 
cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, 
Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 

20 pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimunum, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 

25 Staphylococcus aureus, Staphylococcus epidennidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vidnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

59. The method of Paragraph 57, wherein said target gene or RNA is from an organism 
30 other than E. coli. 

60. The method of Paragraph 57, wherein said gene product is from an organism other 
than E. colL 

61. The method of Paragraph 57, wherein said target is a messenger RNA molecule 
and said activity is translation of said messenger RNA. 

35 62. The method of Paragraph 57, wherein said target is a messenger RNA molecule 

and said activity is transcription of a gene encoding said messenger RNA. 

63. The method of Paragraph 57, wherein said target is a gene and said activity is 
transcription of said gene. 
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64. The method of Paragraph 57, wherein said target is a nontranslated RNA and said 
activity is processing or folding of said nontranslated RNA or assembly of said nontranslated RNA 
into a protein/RNA complex. 

65. The method of Paragraph 57, wherein said target is a messenger RNA molecule 
5 encoding a polypeptide comprising an amino acid sequence selected from the group consisting of 

SEQEDNOs.: 42398-78581. 

66. The method of Paragraph 57, wherein said target comprises a nucleic acid selected 
from the group consisting of SEQ ID NOS.: 6214-42397. 

67 . A compound or nucleic acid identified using the method of Paragraph 5 7 . 

10 68. A method for identifying a compound which reduces the activity or level of a gene 

product required for proliferation of a cell, wherein the activity or expression of said gene product 
is inhibited by an antisense nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 1-6213, said method comprising the steps of: 

(a) providing a sublethal level of an antisense nucleic acid comprising a nucleotide 
15 sequence complementary to a nucleic acid comprising a nucleotide sequence encoding said 

gene product in a cell to reduce the activity or amount of said gene product in said cell, 
thereby producing a sensitized cell; 

(b) contacting said sensitized cell with a compound; and 

(c) determining the degree to which said compound inhibits proliferation of said 
20 sensitized cell relative to a cell which does not contain said antisense nucleic acid. 

69. The method of Paragraph 68, wherein said determining step comprises determining 
whether said compound inhibits the growth of said sensitized cell to a greater extent than said 
compound inhibits the growth of a nonsensitized cell. 

70. The method of Paragraph 68, wherein said cell is a Gram positive bacterium. 

25 71. The method of Paragraph 68, wherein said Gram positive bacterium is selected 

from the group consisting of Staphylococcus species, Streptococcus species, Enierococcus species, 
Mycobacterium species, Clostridium species, and Bacillus species. 

72. The method of Paragraph 68, wherein said bacterium is Staphylococcus aureus. 

73. The method of Paragraph 72, wherein said Staphylococcus species is coagulase 
30 negative. 

74. The method of Paragraph 72, wherein said bacterium is selected from the group 
consisting of Staphylococcus aureus RN450 and Staphylococcus aureus RN4220. 

75. The method of Paragraph 68, wherein said cell is an organism selected from the 
group consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, 

35 Bacillus anthrads, Bacteroides fi'agilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia 
cepacia, Burkholderia fwigorum, Burlcholderia mallei, Campylobacter jejuni, Candida albicans, 
Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, 
Candida guilliennondii, Candida krusei, Candida kefyr (also called Candida pseudotropicalis), 
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Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium 
acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, 
Coccidioides iimnUis, Corynebacteriam diptheriae, Cryptococcus neoformans, Enierobacter 
cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, 

5 Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, 
Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tubercidosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vidgaris, 

10 Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 

15 urealyticum, Vibrio cholerae, Vibrio par ahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

76. The method of Paragraph 68, wherein said cell is not an £. coli cell. 

77. The method of Paragraph 68, wherein said gene product is from an organism other 
than E. coli. 

20 78. The method of Paragraph 68, wherein said antisense nucleic acid is transcribed 

from an inducible promoter. 

79. The method of Paragraph 68, further comprising the step of contacting said cell 
with a concentration of inducer which induces transcription of said antisense nucleic acid to a 
sublethal level. 

25 80. The method of Paragraph 68, wherein growth inhibition is measured by monitoring 

optical density of a culture growth solution. 

8 1 . The method of Paragraph 68, wherein said gene product is a polypeptide. 

82. The method of Paragraph 81, wherein said polypeptide comprises an amino acid 
sequence selected from the group consisting of SEQ ID NOs.: 42398-78581. 

30 83 . The method of Paragraph 68, wherein said gene product is an RNA. 

84. The method of Paragraph 68, wherein nucleic acid encoding said gene product 
comprises a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214-42397. 

85 . A compound identified using the method of Paragraph 68. 

86. A method for inhibiting cellular proliferation comprising introducing an effective 
35 amount of a compound with activity against a gene whose activity or expression is inhibited by an 

antisense nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 
ID NOs.: 1-6213 or a compound with activity against the product of said gene into a population of 
cells expressing said gene. 
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87. The method of Paragraph 86, wherein said compound is an antisense nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, or a 
proliferation-inhibiting portion thereof. 

88. The method of Paragraph 86, wherein said proliferation inhibiting portion of one of 
5 SEQ ID NOs.: 1-6213 is a fragment comprising at least 10, at least 20, at least 25, at least 30, at 

least 50 or more than 51 consecutive nucleotides of one of SEQ ID NOs.: 1-6213. 

89. The method of Paragraph 86, wherein said population is a population of Gram 
positive bacteria. 

90. The method of Paragraph 89, wherein said population of Gram positive bacteria is 
10 selected from the group consisting of a population of Staphylococcus species, Streptococcus 

species, Enterococcus species, Mycobacterium species, Clostridium species, and Bacillus species. 

91. The method of Paragraph 86, wherein said population is a population of 

Staphylococcus aureus. 

92. The method of Paragraph 91, wherein said population is a population of a 
1 5 bacterium selected from the group consisting of Staphylococcus aureus RN450 and Staphylococcus 

aureus RN4220. 

93. The method of Paragraph 86, wherein said population is a population of a 
bacterium selected from the group consisting of Acinetobacter bawnannii, Anaplasma marginale, 
Aspergillus fumigatus, Bacillus anthracis, Bacteroides jragilis, Bordetella pertussis, Borrelia 

20 burgdorferi, Burkholderia cepacia, BurJdiolderia fiingorum, Burkholderia mallei, Campylobacter 
jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, 
Candida parapsilosis, Candida guilliermondii, Candida krusei, Candida kejyr (also called Candida 
pseudotropicalis\ Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, 
Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, 

25 Coccidioides immitis, Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter 
cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, 
Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 

30 pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia aster oides, Pasteur ella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vtdgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 

35 Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

18 
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94. The method of Paragraph 86, wherein said population is a population of an 
organism other than E. coli. 

95. The method of Paragraph 86, wherein said product of said gene is from an 
organism other than E. colu 

5 96. The method of Paragraph 86, wherein said gene encodes a polypeptide comprising 

an amino acid sequence selected from the group consisting of SEQ ID NOs.: 42398-78581. 

97. The method of Paragraph 86, wherein said gene comprises a nucleotide sequence 
selected from the group consisting of SEQ ID NOS.: 6214-42397, 

98. A composition comprising an effective concentration of an antisense nucleic acid 
10 comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, or a 

proliferation-inhibiting portion thereof in a pharmaceutical^ acceptable carrier. 

99. The composition of Paragraph 98, wherein said proliferation-inhibiting portion of 
one of SEQ ID NOs.: 1-6213 comprises at least 20, at least 25, at least 30, at least 50 or more than 
50 consecutive nucleotides of one of SEQ ID NOs.: 1-6213. 

15 100. A method for inhibiting the activity or expression of a gene in an operon required 

for proliferation wherein the activity or expression of at least one gene in said operon is inhibited by 
an antisense nucleic acid comprising a sequence selected from the group consisting of SEQ ID 
NOs.: 1-6213, said method comprising contacting a cell in a cell population with an antisense 
nucleic acid complementary to at least a portion of said operon. 

20 101. The method of Paragraph 100, wherein said antisense nucleic acid comprises a 

nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213 or a proliferation- 
inhibiting portion thereof. 

102. The method of Paragraph 100, wherein said cell is selected from the group 
consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, Bacillus 

25 anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burkholderia fungorwn, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
gidlliermondii, Candida fcntsei, Candida kejyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Closfridium acetobutylicum, 

30 Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
Corynebacteriwn diptheriae, Gyptococcus neoformans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catairhalis, Mycobacterium avium, Mycobacterium bovis, 

35 Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella midtocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
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Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei t 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
5 urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

103. The method of Paragraph 100, wherein said cell is not an E. coli cell. 

1 04. The method of Paragraph 1 00, wherein said gene is from an organism other than E. 

coli. 

10 105. The method of Paragraph 100, wherein said cell is contacted with said antisense 

nucleic acid by introducing a plasmid which expresses said antisense nucleic acid into said cell 
population. 

106. The method of Paragraph 100, wherein said cell is contacted with said antisense 
nucleic acid by introducing a phage which encodes said antisense nucleic acid into said cell 

15 population. 

107. The method of Paragraph 100, wherein said cell is contacted with said antisense 
nucleic acid by expressing said antisense nucleic acid from the chromosome of cells in said cell 
population. 

108. The method of Paragraph 100, wherein said cell is contacted with said antisense 
20 nucleic acid by introducing a promoter adjacent to a chromosomal copy of said antisense nucleic 

acid such that said promoter directs the transcription of said antisense nucleic acid. 

109. The method of Paragraph 100, wherein said cell is contacted with said antisense 
nucleic acid by introducing a retron which expresses said antisense nucleic acid into said cell 
population. 

25 110. The method of Paragraph 100, wherein said cell is contacted with said antisense 

nucleic acid by introducing a ribozyme into said cell-population, wherein a binding portion of said 

ribozyme comprises said antisense nucleic acid. 

111. The method of Paragraph 100, wherein said cell is contacted with said antisense 

nucleic acid by introducing a liposome comprising said antisense nucleic acid into said cell. 
30 112. The method of Paragraph 100, wherein said. cell is contacted with said antisense 

nucleic acid by electroporation of said antisense nucleic acid into said cell. 

113. The method of Paragraph 100, wherein said antisense nucleic acid is a fragment 

comprising at least 10, at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive 

nucleotides of one of SEQ ID NOs.: 1 -621 3 . 
35 114. The method of Paragraph 100 wherein said antisense nucleic acid is a synthetic 

oligonucleotide. 

115. The method of Paragraph 100, wherein said gene comprises a nucleotide sequence 
selected from the group consisting of SEQ ID NOS.: 6214-42397. 

20 
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116. A method for identifying a gene which is required for proliferation of a cell 
comprising: 

(a) contacting a cell with an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 1-6213, wherein said cell is a 

5 cell other than the organism from which said nucleic acid was obtained; 

(b) determining whether said nucleic acid inhibits proliferation of said cell; and 

(c) identifying the gene in said cell which encodes the mRNA which is 
complementary to said antisense nucleic acid or a portion thereof. 

117. The method of Paragraph 116, wherein said cell is selected from the group 
10 consisting of Staphylococcus species, Streptococcus species, Enterococcus species, Mycobacterium 

species, Clostridium species, and Bacillus species. 

118. The method of Paragraph 116 wherein said cell is selected from the group 
consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, Bacillus 
anthracis, Bacteroides fragilis, Bordetella pertussis, Boirelia burgdorferi, Burkholderia cepacia, 

15 Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
guilliermondii, Candida krusei, Candida kejyr (also called Candida pseudotropicalis\ Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 

20 Corynebacterhun diptheriae, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 

25 pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium r Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 

30 Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

119. The method of Paragraph 1 16, wherein said cell is not E. coli, 

35 120. The method of Paragraph 116, further comprising operably linking said antisense 

nucleic acid to a promoter which is functional in said cell, said promoter being included in a vector, 
and introducing said vector into said cell. 
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121 . A method for identifying a compound having the ability to inhibit proliferation of a 
cell comprising: 

(a) identifying a homolog of a gene or gene product whose activity or level is 
inhibited by a nucleic acid comprising a nucleotide sequence selected from the group 

5 consisting ofSEQ ID NOs. 1-6213 in a test cell, wherein said test cell is not the cell from 

which said nucleic acid was obtained; 

(b) identifying an inhibitory nucleic acid sequence which inhibits the activity of 
said homolog in said test cell; 

(c) contacting said test cell with a sublethal level of said inhibitory nucleic acid, 

10 thus sensitizing said cell; 

(d) contacting the sensitized cell of step (c) with a compound; and 

(e) determining the degree to which said compound inhibits proliferation of said 
sensitized cell relative to a cell which does not contain said inhibitory nucleic acid. 

122. The method of Paragraph 121, wherein said determining step comprises 
15 determining whether said compound inhibits proliferation of said sensitized test cell to a greater 

extent than said compound inhibits proliferation of a nonsensitized test cell. 

123. The method of Paragraph 121, wherein step (a) comprises identifying a nucleic acid 
homologous to a gene or gene product whose activity or level is inhibited by a nucleic acid selected 
from the group consisting of SEQ ID NOs. 1-6213 or a nucleic acid encoding a homologous 

20 polypeptide to a polypeptide whose activity or level is inhibited by a nucleic acid selected from the 
group consisting of SEQ ID NOs. 1-6213 by using an algorithm selected from the group consisting 
of BLASTN version 2.0 with the default parameters and FASTA version 3.0f78 algorithm with the 
default parameters to identify said homologous nucleic acid or said nucleic acid encoding a 
homologous polypeptide in a database. 

25 124. The method of Paragraph 121 wherein said step (a) comprises identifying a 

homologous nucleic acid or a nucleic acid comprising a sequence of nucleotides encoding a 
homologous polypeptide by identifying nucleic acids which hybridize to said nucleic acid selected 
from the group consisting of SEQ ED NOs. 1-6213 or the complement of said nucleic acid selected 
from the group consisting of SEQ ID NOs. 1-6213. 

30 125. The method of Paragraph 121 wherein step (a) comprises expressing a nucleic acid 

selected from the group consisting of SEQ ID NOs. 1-6213 in said test cell. 

126. The method of Paragraph 121, wherein step (a) comprises identifying a 
homologous nucleic acid or a nucleic acid encoding a homologous polypeptide in a test cell 
selected from the group consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus 

35 fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, 
Burkholderia cepacia, Burkholderia fungonim, Burkholderia mallei, Campylobacter jejuni, 
Candida albicans, Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida 
parapsilosis, Candida guilliermondii, Candida Icrusei, Candida hejyr (also called Candida 
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pseudotropicalis), Candida dubliniensis, Chlamydia • pneumoniae, Chlamydia trachomatis, 
Clostridium acetobutylicum, Clostridium botulinum, Clostiidium difficile, Clostridium perfringens, 
Coccidioides immitis, Corynebacterium diptheriae, Cryptococcus neoformans, Entei-obacter 
cloacae, Enter ococcus faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, 

5 Helicobacter pylori, Histoplasma capsulation, Klebsiella pneumoniae, Legionella pneumophila, 
Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 

10 Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 

15 urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

127. The method of Paragraph 121, wherein step (a) comprises identifying a 
homologous nucleic acid or a nucleic acid encoding a homologous polypeptide in a test cell other 
than£. colt 

20 128. The method of Paragraph 121, wherein said inhibitory nucleic acid is an antisense 

nucleic acid. 

129. The method of Paragraph 121, wherein said inhibitory nucleic acid comprises an 
antisense nucleic acid to a portion of said homolog. 

130. The method of Paragraph 121, wherein said inhibitory nucleic acid comprises an 
25 antisense nucleic acid to a portion of the operon encoding said homolog. 

131. The method of Paragraph 121, wherein the step of contacting the cell with a 
sublethal level of said inhibitory nucleic acid comprises directly contacting the surface of said cell 
with said inhibitory nucleic acid. 

132. The method of Paragraph 121, wherein the step of contacting the cell with a 
30 sublethal level of said inhibitory nucleic acid comprises transcribing an antisense nucleic acid 

complementary to at least a portion of the RNA transcribed from said homolog in said cell. 

133. The method of Paragraph 121, wherein said gene product comprises a polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ ID NOs.: 42398- 
78581. 

35 134. The method of Paragraph 121, wherein said gene comprises a nucleotide sequence 

selected from the group consisting of SEQ ID NOS.: 6214-42397. 

135. A compound identified using the method of Paragraph 121. 
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136. A method of identifying a compound having the ability to inhibit proliferation 

comprising: 

(a) contacting a test cell with a sublethal level of a nucleic acid comprising a 
nucleotide sequence selected from the group consisting of SEQ ID NOs. 1-6213 or a 

5 portion thereof which inhibits the proliferation of the cell from which said nucleic acid was 

obtained, thus sensitizing said test cell; 

(b) contacting the sensitized test cell of step (a) with a compound; and 

(c) determining the degree to which said compound inhibits proliferation of said 
sensitized test cell relative to a cell which does not contain said nucleic acid. 

10 137. The method of Paragraph 136, wherein said determining step comprises 

determining whether said compound inhibits proliferation of said sensitized test cell to a greater 
extent than said compound inhibits proliferation of a nonsensitized test cell. 

138. A compound identified using the method of Paragraph 136. 

139. The method of Paragraph 136, wherein said test cell is selected from the group 
15 consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus jumigatus, Bacillus 

anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, BurkJiolderia cepacia, 
Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabrata (also called Tondopsis glabrata\ Candida tropicalis, Candida parapsilosis, Candida 
guilliermondii, Candida krusei, Candida kejyr (also called Candida pseudotropicalis), Candida 

20 dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influemae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 

25 monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 

30 Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella jlexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus matans, Sfreptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 

35 Yersinia pestis and any species falling within the genera of any of the above species. 

140. The method of Paragraph 136, wherein the test cell is not E. coli. 

141 . A method for identifying a compound having activity against a biological pathway 
required for proliferation comprising: 
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(a) sensitizing a cell by providing a sublethal level of an antisense nucleic acid 
complementary to a nucleic acid encoding a gene product required for proliferation, 
wherein the activity or expression of said gene product is inhibited by an antisense nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID 

5 NOs.: 1-6213, in said cell to reduce the activity or amount of said gene product; 

(b) contacting the sensitized cell with a compound; and 

(c) determining the degree to which said compound inhibits the growth of said 
sensitized cell relative to a cell which does not contain said antisense nucleic acid. 

142. The method of Paragraph 141, wherein said determining step comprises 
10 determining whether said compound inhibits the growth of said sensitized cell to a greater extent 

than said compound inhibits the growth of a nonsensitized cell. 

143. The method of Paragraph 141, wherein said cell is selected from the group 
consisting of bacterial cells, fungal cells, plant cells, and animal cells. 

144. The method of Paragraph 141, wherein said cell is a Gram positive bacterium. 

15 145. The method of Paragraph 144, wherein said Gram positive bacterium is selected 

from the group consisting of Staphylococcus species, Streptococcus species, Enterococcus species, 
Mycobacterium species, Clostridium species, and Bacillus species. 

146. The method of Paragraph 145, wherein said Gram positive bacterium is 
Staphylococcus aureus, 

20 147. The method of Paragraph 146, wherein said Gram positive bacterium is selected 

from the group consisting of Staphylococcus aureus RN450 and Staphylococcus aureus RN4220. 

148. The method of Paragraph 141, wherein said cell is selected from the group 
consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, Bacillus 
anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 

25 Burldxolderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
guilliermondii, Candida Icrusei, Candida hefyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium peifringens, Coccidioides immitis, 

30 Coiynebacterium diptheriae, Ciyptococcus neofonnans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tubercidosis, Mycoplasma genitalium, Mycoplasma 

35 pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 

25 
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typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Sti-eptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
5 Yersinia pestis and any species falling within the genera of any of the above species. 

149. The method of Paragraph 141, wherein said cell is not an E. coli cell. 

150. The method of Paragraph 141, wherein said gene product is from an organism other 
than£. colL 

151. The method of Paragraph 141, wherein said antisense nucleic acid is transcribed 
1 0 from an inducible promoter. 

152. The method of Paragraph 141, further comprising contacting the cell with an agent 
which induces transcription of said antisense nucleic acid from said inducible promoter, wherein 
said antisense nucleic acid is transcribed at a sublethal level. 

153. The method of Paragraph 141, wherein inhibition of proliferation is measured by 
1 5 monitoring the optical density of a liquid culture. 

154. The method of Paragraph 141, wherein said gene product comprises a polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ ID NOs.: 42398- 
78581. 

155. The method of Paragraph 141, wherein said nucleic acid encoding said gene 
20 product comprises a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 

6214-42397. 

156. A compound identified using the method of Paragraph 141. 

157. A method for identifying a compound having the ability to inhibit cellular 
proliferation comprising: 

25 (a) contacting a cell with an agent which reduces the activity or level of a gene 

product required for proliferation of said cell, wherein said gene product is a gene product 
whose activity or expression is inhibited by an antisense nucleic acid comprising a 
nucleotide sequence selected from the group consisting of SEQ ED NOs.: 1-6213; 
(b) contacting said cell with a compound; and 

30 (c) determining whether said compound reduces proliferation of said contacted cell 

by acting on said gene product. 

158. The method of Paragraph 157, wherein said determining step comprises 
determining whether , said compound reduces proliferation of said contacted cell to a greater extent 
than said compound reduces proliferation of cells which have not been contacted with said agent. 

35 159. The method of Paragraph 157, wherein said cell is selected from the group 

consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, Bacillus 
anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdoiferi, Burkholderia cepacia, 
Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
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glabrata (also called Torulopsis glabrata\ Candida tropicalis, Candida parapsilosis, Candida 
guilliermondii, Candida Icrusei, Candida kefyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostiidium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
5 Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 

10 pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia aster oides, Pasteur ella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei t 

15 Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 
1 60. The method of Paragraph 1 57, wherein said cell is not an E. coli cell. 

20 161. The method of Paragraph 1 57, wherein said gene product is from an organism other 

than E. colu 

162. The method of Paragraph 157, wherein said agent which reduces the activity or 

level of a gene product required for proliferation of said cell comprises an antisense nucleic acid to 

a gene or operon required for proliferation. 
25 163. The method of Paragraph 157, wherein said agent which reduces the activity or 

level of a gene product required for proliferation of said cell comprises a compound known to 

inhibit growth or proliferation of a cell. 

164. The method of Paragraph 157, wherein said cell contains a mutation which reduces 

the activity or level of said gene product required for proliferation of said cell. 
30 165. The method of Paragraph 157, wherein said mutation is a temperature sensitive 

mutation. 

166. The method of Paragraph 157, wherein said gene product comprises a polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ 3D NOs.: 42398- 
78581. 

35 167. A compound identified using the method of Paragraph 157. 

168. A method for identifying the biological pathway in which a proliferation-required 
gene or its gene product lies, wherein said gene or gene product comprises a gene or gene product 
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whose activity or expression is inhibited by an antisense nucleic acid comprising a sequence 
selected from the group consisting of SEQ ID NOs.: 1-6213, said method comprising: 

(a) providing a sublethal level of an antisense nucleic acid which inhibits the 
activity of said proliferation-required gene or gene product in a test cell; 
5 (b) contacting said test cell with a compound known to inhibit growth or 

proliferation of a cell, wherein the biological pathway on which said compound acts is 
known; and 

(c) determining the degree to which said proliferation of said test cell is inhibited 
relative to a cell which was not contacted with said compound. 
10 169. The method of Paragraph 168, wherein said determining step comprises 

determining whether said test cell has a substantially greater sensitivity to said compound than a 
cell which does not express said sublethal level of said antisense nucleic acid. 

170. The method of Paragraph 168, wherein said gene product comprises a polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ ID NOs.: 42398- 

15 78581. 

171. The method of Paragraph 168, wherein said test cell is selected from the group 
consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigates, Bacillus 
anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 

20 glabrata (also called Tomlopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
guilliennondii, Candida krusei, Candida kejyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium perfiingens, Coccidioides immitis, 
Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus 

25 faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsidatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 

30 haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudornonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 

35 pneumoniae, Streptococcus rnutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

172. The method of Paragraph 1 68, wherein said test cell is not an E. coli cell. 
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173. The method of Paragraph 168, wherein said gene product is from an organism other 

than E. coli. 

174. A method for determining the biological pathway on which a test compound acts 
comprising: 

(a) providing a sublethal level of an antisense nucleic acid complementary to a 
proliferation-required nucleic acid in a first cell, wherein the activity or expression of said 
proliferation-required nucleic acid is inhibited by an antisense nucleic acid comprising a 
sequence selected from the group consisting of SEQ ID NOs.: 1-6213 and wherein the 
biological pathway in which said proliferation-required nucleic acid or a protein encoded 
by said proliferation-required nucleic acid lies is known, 

(b) contacting said first cell with said test compound; and 

(c) determining the degree to which said test compound inhibits proliferation of 
said first cell relative to a cell which does not contain said antisense nucleic acid. 

175. The method of Paragraph 174, wherein said determining step comprises 
determining whether said first cell has a substantially greater sensitivity to said test compound than 
a cell which does not express said sublethal level of said antisense nucleic acid. 

176. The method of Paragraph 174, further comprising: 

(d) providing a sublethal level of a second antisense nucleic acid complementary to 
a second proliferation-required nucleic acid in a second cell, wherein said second 
proliferation-required nucleic acid is in a different biological pathway than said 
proliferation-required nucleic acid in step (a); and 

(e) determining whether said second cell does not have a substantially greater 
sensitivity to said test compound than a cell which does not express said sublethal level of 
said second antisense nucleic acid, wherein said test compound is specific for the biological 
pathway against which the antisense nucleic acid of step (a) acts if said first cell has a 
substantially greater sensitivity to said test compound than said second cell. 

177. The method of Paragraph 174, wherein said first cell is selected from the group 
consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus fwnigatus, Bacillus 
anthracis, Bacteroides fi'agilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
guilliermondii, Candida fonsei, Candida kefyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
Coiynebacterium diptheriae, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
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Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 

5 Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 

10 Yersinia pestis and any species falling within the genera of any of the above species. 

178. The method of Paragraph 174, wherein said first cell is not an E. coli cell. 

179. The method of Paragraph 174, wherein said proliferation-required nucleic acid is 
from an organism other than E. coli. 

180. A purified or isolated nucleic acid comprising a sequence selected from the group 
15 consisting of SEQ ID NOs.: 1-6213. 

181. A compound which interacts with a gene or gene product whose activity or 
expression is inhibited by an antisense nucleic acid comprising a nucleotide sequence of one of 
SEQ ID NOs.:' 1-6213 to inhibit proliferation. 

182. The compound of Paragraph 181, wherein said gene product is a polypeptide 
20 comprising one of SEQ ID NOs.: 42398-78581. 

183. The compound of Paragraph 181, wherein said gene comprises a nucleotide 
sequence selected from the group consisting of SEQ ID NOS.: 6214-42397. 

184. A compound which interacts with a gene product whose expression is inhibited by 
an antisense nucleic acid comprising a nucleotide sequence of one of SEQ ID NOs.: 1-6213 to 

25 inhibit proliferation. 

1 85. A method for manufacturing an antibiotic comprising the steps of: 
screening one or more candidate compounds to identify a compound that reduces the 

activity or level of a gene product required for proliferation, said gene product comprising a gene 
product whose activity or expression is inhibited by an antisense nucleic acid comprising a 
30 nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213; and 
manufacturing the compound so identified. 

186. The method of Paragraph 185, wherein said screening step comprises performing 
any one of the methods of Paragraphs 44, 68, 121, 136, 141, and 157. 

187. The method of Paragraph 185, wherein said gene product is a polypeptide 
35 comprising one of SEQ ID NOs:42398-78581. 

188. A method for inhibiting proliferation of a cell in a subject comprising administering 
an effective amount of a compound that reduces the activity or level of a gene product required for 
proliferation of said cell, said gene product comprising a gene product whose activity or expression 
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is inhibited by an antisense nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 1-6213 to said subject. 

189. The method of Paragraph 188 wherein said subject is selected from the group 
consisting of vertebrates, mammals, avians, and human beings. 
5 190. The method of Paragraph 188, wherein said gene product comprises a polypeptide 

comprising an amino acid sequence selected from the group consisting of SEQ ID NOs.: 42398- 
78581. 

191. The method of Paragraph 188, wherein said cell is selected from the group 
consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, Bacillus 

10 anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burkholderia fungomm, Burklwlderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
gtulliennondii, Candida krusei, Candida kejyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 

15 Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
Corynebacterium diptheriae, Qyptococcus neoformans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 

20 Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella midtocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella entenca, Salmonella paratyphi, Salmonella typhi, Salmonella 

25 typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

30 1 92. The method of Paragraph 1 88, wherein said cell is not E. coli. 

193. The method of Paragraph 188, wherein said gene product is from an organism other 
than E. coli. 

194. A purified or isolated nucleic acid consisting essentially of the coding sequence of 
one of SEQ 3D NOs; 6214-42397. 

35 195. A fragment of the nucleic acid of Paragraph 8, said fragment comprising at least 

10, at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive nucleotides of one of 
SEQ ID NOs: 6214-42397. 
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196. A purified or isolated nucleic acid comprising a nucleic acid having at least 70% 
nucleotide sequence identity to a nucleotide sequence selected from the group consisting of SEQ ID 
NOs.: 6214-42397, fragments comprising at least 25 consecutive nucleotides of SEQ ID NOs.: 
6214-42397, the nucleotide sequences complementary to SEQ ID NOs.:6214-42397, and the 

5 nucleotide sequences complementary to fragments comprising at least 25 consecutive nucleotides 
of SEQ ID NOs.: 6214-42397 as determined using BLASTN version 2.0 with the default 
parameters. 

197. The nucleic acid of Paragraph 196, wherein said nucleic acid is from an organism 
selected from the group consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus 

10 fiimigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi 
Burkholderia cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, 
Candida albicans, Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida 
parapsilosis, Candida guilliermondii, Candida la-usei, Candida kefyr (also called Candida 
pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, 

15 Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, 
Coccidioides irnmitis, Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter 
cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, 
Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 

20 Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pastenrella 
haemolytica, Pastenrella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 

25 typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidennidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enter ocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

30 198. The nucleic acid of Paragraph 196, wherein said nucleic acid is from an organism 

other than E. coli. 

199. A method of inhibiting proliferation of a cell comprising inhibiting the activity or 
reducing the amount of a gene product in said cell or inhibiting the activity or reducing the amount 
of a nucleic acid encoding said gene product in said cell, wherein said gene product is selected from 
35 the group consisting of a gene product having having at least 70% nucleotide sequence identity as 
determined using BLASTN version 2.0 with the default parameters to a gene product whose 
expression is inhibited by an antisense nucleic acid comprising a nucleotide sequence selected from 
the group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic acid having at 
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least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with the default 
parameters to a nucleic acid encoding a gene product whose expression is inhibited by an antisense 
nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ ID 
NOs:l-6213, a gene product having at least 25% amino acid identity as determined using FASTA 
5 version 3.0t78 with the default parameters to a gene product whose expression is inhibited by an 
antisense nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 
ID NOs.: 1-6213, a gene product encoded by a nucleic acid which hybridizes to a nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213 
under stringent conditions, a gene product encoded by a nucleic acid which hybridizes to a nucleic 
10 acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213 
under moderate conditions, and a gene product whose activity may be complemented by the gene 
product whose activity is inhibited by a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of SEQ ID NOs: 1-6213. 

200. The method of Paragraph 199, wherein said method comprises inhibiting said 
15 activity or reducing said amount of said gene product or inhibiting the activity or reducing the 
amount of a nucleic acid encoding said gene product in an organism selected from the group 
consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus fiimigatus, Bacillus 
anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
20 glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
guilliennondii, Candida krusei, Candida kefyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
Coiynebacterium diptheriae, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus 
25 faecalis, Enterococcus faecium, Escherichia coli, Haemophilus injluenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tubercidosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
30 haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vidgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
35 pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica. 
Yersinia pestis and any species falling within the genera of any of the above species. 
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20 1. The naflMd ©f Paragraph 199, wherein said method comprises inhibiting said 
activity or reducing sail anraiunt of said gene product or inhibiting the activity or reducing the 
amount of a nucleic aciiencrcding said gene product in an organism other than E. coli. 

202. The metood off Paragraph 199, wherein said gene product is from an organism other 
5 than E. coli. 

203. The ircfiwd ©f Paragraph 199, wherein said gene product comprises a polypeptide 
selected from the group ccmsisting of a polypeptide having at least 25% amino acid identity as 
determined using FASTA wrsion 3.0t78 to a polypeptide selected from the group consisting of 
SEQ ID NOs.: 42398-78581 and a polypeptide whose activity may be complemented by a 

10 polypeptide selected from the group consisting of SEQ ID NOs: 42398-78581 . 

204. The mrihod of Paragraph 199, wherein said gene product is encoded by a nucleic 
acid selected from the group consisting of a nucleic acid comprising a nucleic acid having at least 
70% nucleotide sequence identity as determined using BLASTN version 2.0 with the default 
parameters to a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214- 

15 42397, a nucleic acid comprising a nucleotide sequence which hybridizes to a sequence selected 
from the group consistig of SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic 
acid comprising a nucloetide sequence which hybridizes to a nucleotide sequence selected from the 
group consisting of SEQ ID NOS.: 6214-42397 under moderate condtions. 

205. A method for identifying a compound which influences the activity of a gene 
20 product required for proliferation comprising: 

contacting a candidate compound with a gene product selected from the group 
consisting of a gene product having at least 70% nucleotide sequence identity as determined 
using BLASTN version 2.0 with the default parameters to a gene product whose expression 
is inhibited by an antisense nucleic acid comprising a nucleotide sequence selected from the 

25 group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic acid having 

at least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with 
the default parameters to a nucleic acid encoding a gene product whose expression is 
inhibited by an antisense nucleic acid comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NOs: 1-6213, a gene product having at least 25% amino acid 

30 identity as determined using FASTA version 3.0t78 with the default parameters to a gene 

product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product 
encoded by a nucleic acid comprising a nucleotide sequence which hybridizes to a nucleic 
acid selected from the group consisting of SEQ ID NOs.: 1-6213 under stringent 

35 conditions, a gene product encoded by a nucleic acid comprising a nucleotide sequence 

which hybridizes to a nucleic acid selected from the group consisting of SEQ ID NOs.: 1- 
6213 under moderate conditions, and a gene product whose activity may be complemented 
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by the gene product whose activity is inhibited by a nucleic acid selected from the group 
consisting of SEQIDNOs: 1-6213; and 

determining whether said candidate compound influences the activity of said gene 

product. 

5 206. The method of Paragraph 205, wherein said gene product is from an organism 

selected from the group consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus 
fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorfen, 
Burkholderia cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, 
Candida albicans, Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida 

10 parapsilosis, Candida guilliertnondii, Candida Icrusei, Candida kejyr (also called Candida 
pseudoiropicalis\ Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, 
Clostridium acetobutylicum, Clostiidium botulinum, Clostndium difficile, Clostridium perfringms, 
Coccidioides immitis, Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter 
cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, 

15 Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, 
Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 

20 Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteiiae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 

IS urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

207. The method of Paragraph 205, wherein said gene product is from an organism other 
than E. coli. 

208. The method of Paragraph 205, wherein said gene product is a polypeptide selected 
30 from the group consisting of a polypeptide having at least 25% amino acid identity as determined 

using FASTA version 3.0t78 to a polypeptide selected from the group consisting of SEQ ID NOs.: 
42398-78581 and a polypeptide whose activity may be complemented by a polypeptide selected 
from the group consisting of SEQ ID NOs: 42398-78581. 

209. The method of Paragraph 205, wherein said gene product is encoded by a nucleic 
35 acid selected from the group consisting of a nucleic acid comprising a nucleic acid having at least 

70% nucleotide sequence identity as determined using BLASTN version 2.0 with the default 
parameters to a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214- 
42397, a nucleic acid which hybridizes to a sequence selected from the group consisting of SEQ ID 
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NOS.: 6214-42397 under stringent conditions, and a nucleic acid which hybridizes to a sequence 
selected from the group consisting of SEQ ID NOS.: 6214-42397 under moderate condtions. 

210. A compound identified using the method of Paragraph 205. 

211. A method for identifying a compound or nucleic acid having the ability to reduce 
5 the activity or level of a gene product required for proliferation comprising: 

(a) providing a target that is a gene or RNA, wherein said target comprises a 
nucleic acid that encodes a gene product selected from the group consisting of a gene 
product having having at least 70% nucleotide sequence identity as determined using 
BLASTN version 2.0 with the default parameters to a gene product whose expression is 

10 inhibited by an antisense nucleic acid comprising a nucleotide sequence selected from the 

group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic acid having 
at least 70% nucleic acid identity as determined using BLASTN version 2.0 with the 
default parameters to a nucleic acid encoding a gene product whose expression is inhibited 
by an antisense nucleic acid comprising a nucleotide sequence selected from the group 

15 consisting of SEQ ID NOs:l-6213, a gene product having at least 25% amino acid identity 

as determined using FASTA version 3 .0t78 with the default parameters to a gene product 
whose expression is inhibited by an antisense nucleic acid comprising a sequence selected 
from the group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic 
acid comprising a nucleotide sequence which hybridizes to a nucleic acid selected from the 

20 group consisting of SEQ ID NOs.: 1-621 3 under stringent conditions, a gene product 

encoded by a nucleic acid comprising a nucleotide sequence which hybridizes to a nucleic 
acid selected from the group consisting of SEQ ID NOs.: 1-6213 under moderate 
conditions, and a gene product whose activity may be complemented by the gene product 
whose activity is inhibited by a nucleic acid selected from the group consisting of SEQ ID 

25 NOs: 1-6213; 

(b) contacting said target with a candidate compound or nucleic acid; and 

(c) measuring an activity of said target. 

212. The method of Paragraph 211, wherein said target gene or RNA is from an 
organism selected from the group consisting of Acinetobacter baumannii, Anaplasma marginale, 

30 Aspergillus fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia 
burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter 
jejuni, Candida albicans, Candida glabrata (also called Tondopsis glabrata), Candida tropicalis, 
Candida parapsilosis, Candida guilliermondii, Candida Icrusei, Candida kefyr (also called Candida 
pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, 

35 Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, 
Coccidioides immitis, Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter 
cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, 
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Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 

5 Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enierica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 

10 urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vidnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

213. The method of Paragraph 211, wherein said target gene or RNA is from an 
organism other than E. coli. 

214. The method of Paragraph 211, wherein said gene product is from an organism other 
15 than£. coli. 

215. The method of Paragraph 211, wherein said target is a messenger RNA molecule 
and said activity is translation of said messenger RNA. 

216. The method of Paragraph 211, wherein said compound is a nucleic acid and said 
activity is translation of said gene product. 

20 217. The method of Paragraph 211, wherein said target is a gene and said activity is 

transcription of said gene. 

218. The method of Paragraph 211, wherein said target is a nontranslated RNA and said 
activity is processing or folding of said nontranslated RNA or assembly of said nontranslated RNA 
into a protein/RNA complex. 

25 219. The method of Paragraph 211, wherein said target gene is a messenger RNA 

molecule encoding a polypeptide selected from the group consisting of a polypeptide having at least 
25% amino acid identity as determined using FASTA version 3.0t78 to a polypeptide selected from 
the group consisting of SEQ ID NOs.: 42398-78581 and a polypeptide whose activity may be 
complemented by a polypeptide selected from the group consisting of SEQ ID NOs: 42398-78581 . 

30 220. The method of Paragraph 11, wherein said target gene comprises a nucleic acid 

selected from the group consisting of a nucleic acid comprising a nucleic acid having at least 70% 
nucleotide sequence identity as determined using BLASTN version 2.0 with the default parameters 
to a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214-42397, a 
nucleic acid which hybridizes to a sequence selected from the group consisting of SEQ ID NOS.: 

35 6214-42397 under stringent conditions, and a nucleic acid which hybridizes to a sequence selected 
from the group consisting of SEQ ID NOS.: 6214-42397 under moderate condtions. 

221. A compound or nucleic acid identified using the method of Paragraph 211. 
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222. A method for identifying a compound which reduces the activity or level of a gene 
product required for proliferation of a cell comprising: 

(a) providing a sublethal level of an antisense nucleic acid complementary to a 
nucleic acid encoding said gene product in a cell to reduce the activity or amount of said 

5 gene product in said cell, thereby producing a sensitized cell, wherein said gene product is 

selected from the group consisting of a gene product having having at least 70% nucleic 
acid identity as determined using BLASTN version 2.0 with the default parameters to a 
gene product whose expression is inhibited by an antisense nucleic acid comprising a 
nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene 

10 product encoded by a nucleic acid having at least 70% nucleotide sequence identity as 

determined using BLASTN version 2.0 with the default parameters to a nucleic acid 
encoding a gene product whose expression is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs:l- 
6213, a gene product having at least 25% amino acid identity as determined using FASTA 

15 version 3.0t78 with the default parameters to a gene product whose expression is inhibited 

by an antisense nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic acid comprising a 
nucleotide sequence which hybridizes to a nucleic acid selected from the group consisting 
of SEQ ID NOs.: 1-6213 under stringent conditions, a gene product encoded by a nucleic 

20 acid comprising a nucleotide sequence which hybridizes to a nucleic acid comprising a 

nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213 under 
moderate conditions, and a gene product whose activity may be complemented by the gene 
product whose activity is inhibited by a nucleic acid selected from the group consisting of 
SEQ ID NOs: 1-6213; 

25 (b) contacting said sensitized cell with a compound; and 

(c) determining the degree to which said compound inhibits the growth of said 
sensitized cell relative to a cell which does not contain said antisense nucleic acid. 

223. The method of Paragraph 222, wherein said determining step comprises 
determining whether said compound inhibits the growth of said sensitized cell to a greater extent 

30 than said compound inhibits the growth of a nonsensitized cell. 

224. The method of Paragraph 222, wherein said sensitized cell is a Gram positive 
bacterium. 

225. The method of Paragraph 224, wherein said Gram positive bacterium is selected 
from the group consisting of Staphylococcus species, Streptococcus species, Enterococcus species, 

35 Mycobacterium species, Clostridium species, and Bacillus species. 

226. The method of Paragraph 225, wherein said bacterium is Staphylococcus aureus. 

227. The method of Paragraph 224, wherein said Staphylococcus species is coagulase 
negative. 
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228. The method of Paragraph 226, wherein said bacterium is selected from the group 
consisting of Staphylococcus aureus RN450 and Staphylococcus aureus RN4220. 

229. The method of Paragraph 222, wherein said sensitized cell is an organism selected 
from the group consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus 

5 fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, 
Burkholderia cepacia, Burkholderia jungorum, Burkholderia mallei, Campylobacter jejuni, 
Candida albicans, Candida glabrata (also called Torulopsis glabrata\ Candida tropicalis, Candida 
parapsilosis, Candida guilliermondii, Candida krusei, Candida kefyr (also called Candida 
pseudotropicalis\ Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, 

10 Clostridium acetobutylicum, Clostridium botulinwn, Clostridium difficile, Clostridium perfringens, 
Coccidioides immitis, Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter 
cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, 
Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 

15 Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudo?nonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 

20 typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexnen, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

25 230. The method of Paragraph 222, wherein said cell is an organism other than E. colL 

23 1 . The method of Paragraph 222, wherein said gene product is from an organism other 
than£. coli. 

232. The method of Paragraph 222, wherein said antisense nucleic acid is transcribed 
from an inducible promoter. 

30 233. The method of Paragraph 222, further comprising the step of contacting said cell 

with a concentration of inducer which induces transcription of said antisense nucleic acid to a 
sublethal level. 

234. The method of Paragraph 222, wherein growth inhibition is measured by 
monitoring optical density of a culture medium. 
35 235. The method of Paragraph 222, wherein said gene product is a polypeptide. 

236. The method of Paragraph 235, wherein said polypeptide comprises a polypeptide 
selected from the group consisting of a polypeptide having at least 25% amino acid identity as 
determined using FASTA version 3.0t78 to a polypeptide selected from the group consisting of 
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SEQ ID NOs.: 42398-78581 and a polypeptide whose activity may be complemented by a 
polypeptide selected from the group consisting of SEQ ID NOs: 42398-7858 1 . 

237. The method of Paragraph 222, wherein said gene product is an KNA. 

238. The method of Paragraph 222, wherein said nucleic acid encoding said gene 
5 product comprises a nucleic acid selected from the group consisting of a nucleic acid comprising a 

nucleic acid having at least 70% nucleic acid identity as determined using BLASTN version 2.0 
with the default parameters to a sequence selected from the group consisting of SEQ ID NOS.: 
6214-42397, a nucleic acid which hybridizes to a sequence selected from the group consisting of 
SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic acid which hybridizes to a 
10 sequence selected from the group consisting of SEQ ID NOS.: 6214-42397 under moderate 
condtions. 

239. A compound identified using the method of Paragraph 222. 

240. A method for inhibiting cellular proliferation comprising introducing a compound 
with activity against a gene product or a compound with activity against a gene encoding said gene 

15 product into a population of cells expressing said gene product, wherein said gene product is 
selected from the group consisting of a gene product having at least 70% nucleotide sequence 
identity as determined using BLASTN version 2.0 with the default parameters to a gene product 
whose expression is inhibited by an antisense nucleic acid comprising a nucleotide sequence 
selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic 

20 acid having at least 70% nucleotide sequence identity as determined using BLASTN version 2.0 
with the default parameters to a nucleic acid encoding a gene product whose expression is inhibited . 
by an antisense nucleic acid comprising a nucleotide sequence selected from the group consisting of 
SEQ ID NOs: 1-6213, a gene product having at least 25% amino acid identity as determined using 
FASTA version 3.0t78 with the default parameters to a gene product whose expression is inhibited 

25 by an antisense nucleic acid comprising a nucleotide sequence selected from the group consisting of 
SEQ ED NOs.: 1-6213, a gene product encoded by a nucleic acid comprising a nucleotide sequence 
which hybridizes to a nucleic acid selected from the group consisting of SEQ ID NOs.: 1-6213 
under stringent conditions, a gene product encoded by a nucleic acid comprising a nucleotide 
sequence which hybridizes to a nucleic acid selected from the group consisting of SEQ ID NOs.: 1- 

30 6213 under moderate conditions, and a gene product whose activity may be complemented by the 
gene product whose activity is inhibited by a nucleic acid selected from the group consisting of 
SEQ ID NOs: 1-6213. 

241. The method of Paragraph 240, wherein said compound is an antisense nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ED NOs.: 1-6213, or a 

35 proliferation-inhibiting portion thereof. 

242. The method of Paragraph 240, wherein said proliferation inhibiting portion of one 
of SEQ ID NOsj 1-6213 is a fragment comprising at least 10, at least 20, at least 25, at least 30, at 
least 50 or more than 51 consecutive nucleotides of one of SEQ ID NOs.: 1-6213. 
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243. The method of Paragraph 240, wherein said population is a population of Gram 
positive bacteria. 

244. The method of Paragraph 243, wherein said population of Gram positive bacteria is 
selected from the group consisting of a population of Staphylococcus species, Streptococcus 

5 species, Enterococcus species, Mycobacterium species, Clostridium species, and Bacillus species. 

245. The method of Paragraph 243, wherein said population is a population of 
Staphylococcus aureus. 

246. The method of Paragraph 245, wherein said population is a population of a 
bacterium selected from the group consisting of Staphylococcus aureus RN450 and Staphylococcus 

10 awrettfRN4220. 

247. The method of Paragraph 240, wherein said population is a population of a 
bacterium selected from the group consisting of Acinetobacter baumannii, Anaplasma marginale, 
Aspergillus fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Boirelia 
burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkliolderia mallei, Campylobacter 

15 jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata\ Candida tropicalis, 
Candida par apsilosis, Candida giulliermondii, Candida Izrusei, Candida kefyr (also called Candida 
pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, 
Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium perfiingens, 
Coccidioides immitis, Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter 

20 cloacae, Enterococcus faecalis, Entei'ococcus faecium, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, 
Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 

IS haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 

30 pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

248. The method of Paragraph 240, wherein said population is a population of an 
organism other than E. coli. 

35 249. The method of Paragraph 240, wherein said, product of said gene is from an 

organism other than E. coli. 

250. The method of Paragraph 240, wherein said gene product is selected from the 
group consisting of a polypeptide having at least 25% amino acid identity as determined using 
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FASTA version 3.0t78 to a polypeptide selected from the group consisting of SEQ ID NOs.: 42398- 
78581 and a polypeptide whose activity may be complemented by a polypeptide selected from the 
group consisting of SEQ ID NOs: 42398-78581 . 

25 1 . The method of Paragraph 240, wherein said gene comprises a nucleic acid selected 
5 from the group consisting of a nucleic acid comprising a nucleic acid having at least 70% 

nucleotide sequence identity as determined using BLASTN version 2.0 with the default parameters 
to a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214-42397, a 
nucleic acid comprising a nucleotide sequence which hybridizes to a nucleotide sequence selected 
from the group consisting of SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic 
10 acid comprising a nucleotide sequence which hybridizes to a nucleotide sequence selected from the 
group consisting of SEQ ID NOS.: 6214-42397 under moderate condtions. 

252. A preparation comprising an effective concentration of an antisense nucleic acid in 
a pharmaceutical^ acceptable carrier wherein said antisense nucleic acid is selected from the group 
consisting of a nucleic acid comprising a sequence having at least 70% nucleotide sequence identity 

15 as determined using BLASTN version 2.0 with the default parameters to a nucleotide sequence 
selected from the group consisting of SEQ ID NOs.: 1-6213 or a proliferation-inhibiting portion 
thereof, a nucleic acid comprising a nucleotide sequence which hybridizes to a nucleic acid selected 
from the group consisting of SEQ ID NOs.: 1-6213 under stringent conditions, and a nucleic acid 
comprising a nucleotide sequence which hybridizes to a nucleic acid selected from the group 

20 consisting of SEQ ID NOs.: 1-6213 under moderate conditions. 

253. The preparation of Paragraph 252, wherein said proliferation-inhibiting portion of 
one of SEQ ID NOs.: 1-6213 comprises at least 10, at least 20, at least 25, at least 30, at least 50 or 
more than 50 consecutive nucleotides of one of SEQ ID NOs.: 1-6213. 

254. A method for inhibiting the activity or expression of a gene in an operon which 
25 encodes a gene product required for proliferation comprising contacting a cell in a cell population 

with an antisense nucleic acid comprising at least a proliferation-inhibiting portion of said operon in 
an antisense orientation, wherein said gene product is selected from the group consisting of a gene 
product having at least 70% nucleotide sequence identity as determined using BLASTN version 2.0 
with the default parameters to a gene product whose expression is inhibited by an antisense nucleic 

30 acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, 
a gene product encoded by a nucleic acid having at least 70% nucleotide sequence identity as 
determined using BLASTN version 2.0 with the default parameters to a nucleic acid encoding a 
gene product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs:l-6213, a gene product having at least 

35 25% amino acid identity as determined using FASTA version 3.0t78 with the default parameters to 
a gene product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a 
nucleic acid comprising a nucleotide sequence which hybridizes to a nucleic acid selected from the 
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group consisting of SEQ ID NOs.: 1-6213 under stringent conditions, a gene product encoded by a 
nucleic acid comprising a nucleotide sequence which hybridizes to a nucleic acid selected from the 
group consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene product whose 
activity may be complemented by the gene product whose activity is inhibited by a nucleic acid 
5 selected from the group consisting of SEQ ID NOs: 1-6213. 

255. The method of Paragraph 254, wherein said antisense nucleic acid comprises a 
nucleotide sequence having at least 70% nucleotide sequence identity as determined using 
BLASTN version 2.0 with the default parameters to a nucleotide seqence selected from the group 
consisting of SEQ ID NOs.: 1-6213, a proliferation inhibiting portion thereof, a nucleic acid 

10 comprising a nucleotide sequence which hybridizes to a nucleic acid selected from the group 
consisting of SEQ ID NOs.: 1-6213 under stringent conditions, and a nucleic acid which comprising 
a nucleotide sequence which hybridizes to a nucleic acid selected .from the group consisting of SEQ 
ID NOs.: 1-6213 under moderate conditions. 

256. The method of Paragraph 254, wherein said cell is selected from the group 
15 consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, Bacillus 

anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabrata (also called Torulopsis glabrata\ Candida tropicalis, Candida parapsilosis, Candida 
guilliermondii, Candida Icrusei, Candida kejyr (also called Candida pseudotropicalis), Candida 

20 dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 

25 monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 

30 Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnijicans, Yersinia enterocolitica, 

35 Yersinia pestis and any species falling within the genera of any of the above species. 

257. The method of Paragraph 254, wherein said cell is not an E. coli cell. 

258. The method of Paragraph 254, wherein said gene is from an organism other than E. 

coli. 
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259. The method of Paragraph 254, wherein said cell is contacted with said antisense 
nucleic acid by introducing a plasmid which transcribes said antisense nucleic acid into said cell 
population. 

260. The method of Paragraph 254, wherein said cell is contacted with said antisense 
5 nucleic acid by introducing a phage which transcribes said antisense nucleic acid into said cell 

population. 

261. The method of Paragraph 254, wherein said cell is contacted with said antisense 
nucleic acid by transcribing said antisense nucleic acid from the chromosome of cells in said cell 
population. 

10 262. The method of Paragraph 254, wherein said cell is contacted with said antisense 

nucleic acid by introducing a promoter adjacent to a chromosomal copy of said antisense nucleic 
acid such that said promoter directs the synthesis of said antisense nucleic acid. 

263. The method of Paragraph 254, wherein said cell is contacted with said antisense 
nucleic acid by introducing a retron which expresses said antisense nucleic acid into said cell 

15 population. 

264. The method of Paragraph 254, wherein said cell is contacted with said antisense 
nucleic acid by introducing a ribozyme into said cell-population, wherein a binding portion of said 
ribozyme is complementary to said antisense oligonucleotide. 

265. The method of Paragraph 254, wherein said cell is contacted with said antisense 
20 nucleic acid by introducing a liposome comprising said antisense oligonucleotide into said cell. 

266. The method of Paragraph 254, wherein said cell is contacted with said antisense 
nucleic acid by electroporation of said antisense nucleic acid into said cell. 

267. The method of Paragraph 254, wherein said antisense nucleic acid has at least 70% 
nucleotide sequence identity as determined using BLASTN version 2.0 with the default parameters 

25 to a nucleotide sequence comprising at least 10, at least 20, at least 25; at least 30, at least 50 or 
more than 50 consecutive nucleotides of one of SEQ ID NOs.: 1-6213. 

268. The method of Paragraph 254 wherein said antisense nucleic acid is a synthetic 
oligonucleotide. 

269. The method of Paragraph 254, wherein said gene comprises a nucleic acid selected 
30 from the group consisting of a nucleic acid comprising a nucleic acid having at least 70% 

nucleotide sequence identity as determined using BLASTN version 2.0 with the default parameters 
to a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214-42397, a 
nucleic acid comprising a nucleotide sequence which hybridizes to a sequence selected from the 
group consisting of SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic acid 
35 comprising a nucleotide sequence which hybridizes to a nucleotide sequence selected from the 
group consisting of SEQ ED NOS.: 6214-42397 under moderate condtions. 

270. A method for identifying a gene which is required for proliferation of a cell 
comprising: 
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(a) contacting a cell with an antisense nucleic acid selected from the group 
consisting of a nucleic acid at least 70% nucleotide sequence identity as determined using 
BLASTN version 2.0 with the default parameters to a nucleotide sequence selected from 
the group consisting of SEQ ID NOs.: 1-6213 or a proliferation-inhibiting portion thereof, a 
5 nucleic acid comprising a nucleotide sequence which hybridizes to a nucleic acid selected 

from the group consisting of SEQ ID NOs.: 1-6213 under stringent conditions, and a 
nucleic acid comprising a nucleotide sequence which hybridizes to a nucleic acid selected 
from the group consisting of SEQ ID NOs.: 1-6213 under moderate conditions, wherein 
said cell is a cell other than the organism from which said nucleic acid was obtained; 
10 (b) determining whether said nucleic acid inhibits proliferation of said cell; and 

(c) identifying the gene in said cell which encodes the mRNA which is 
complementary to said antisense nucleic acid or a portion thereof. 

271. The method of Paragraph 270, wherein said cell is selected from the group 
consisting of Staphylococcus species, Streptococcus species, Enterococcus species, Mycobacterium 

15 species, Clostridium species, and Bacillus species. 

272. The method of Paragraph 270 wherein said cell is selected from the group 
consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus jumigatus, Bacillus 
anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, BurkJiolderia cepacia, 
Burkholdeiia jungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 

20 glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
guilliermondii, Candida Icrusei, Candida kejyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
Coiynebacterium diptheriae, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus 

25 faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tubercidosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia aster oides, Pasteurella 

30 haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas synngae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 

35 pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio \mlnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

273. The method of Paragraph 270, wherein said cell is not E. coli. 
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274. The method of Paragraph 270, further comprising operably linking said antisense 
nucleic acid to a promoter which is functional in said cell, said promoter being included in a vector, 
and introducing said vector into said cell. 

275 . A method for identifying a compound having the ability to inhibit proliferation of a 
5 cell comprising: 

(a) identifying a homolog of a gene or gene product whose activity or level is 
inhibited by an antisense nucleic acid in a test cell, wherein said test cell is not the 
microorgaism from which the antisense nucleic acid was obtained, wherein said antisense 
nucleic acid is selected from the group consisting of a nucleic acid having at least 70% 

10 nucleotide sequence identity as determined using BLASTN version 2.0 with the default 

parameters to a nucleotide sequence selected from the group consisting of SEQ ID NOs. 1- 
6213, a nucleic acid comprising a nucleotide sequence which hybridizes to a nucleic acid 
selected from the group consisting of SEQ ID NOs.: 1-6213 under stringent conditions, and 
a nucleic acid comprising a nucleotide sequence which hybridizes to a nucleic acid selected 

.15 from the group consisting of SEQ ID NOs.: 1-6213 under moderate conditions; 

(b) identifying an inhibitory nucleic acid sequence which inhibits the activity of 
said homolog in said test cell; 

(c) contacting said test cell with a sublethal level of said inhibitory nucleic acid, 
thus sensitizing said cell; 

20 (d) contacting the sensitized cell of step (c) with a compound; and 

(e) determining the degree to which said compound inhibits proliferation of said 
sensitized cell relative to a cell which does not express said inhibitory nucleic acid. 

276. The method of Paragraph 275, wherein said determining step comprises 
determining whether said compound inhibits proliferation of said sensitized test cell to a greater 

25 extent than said compound inhibits proliferation of a nonsensitized test cell. 

277. The method of Paragraph 275, wherein step (a) comprises identifying a 
homologous nucleic acid to a gene or gene product whose activity or level is inhibited by a nucleic 
acid having at least 70% nucleotide sequence identity as determined using BLASTN version 2.0 
with the default parameters to a nucleotide sequence selected from the group consisting of SEQ ID 

30 NOs. 1-6213 or a nucleic acid encoding a homologous polypeptide to a polypeptide whose activity 
or level is inhibited by a nucleic acid having at least 70% nucleotide sequence identity as 
determined using BLASTN version 2.0 with the default parameters to a nucleotide sequence 
selected from the group consisting of SEQ ID NOs. 1-6213 by using an algorithm selected from the 
group consisting of BLASTN version 2.0 with the default parameters and FASTA version 3.0t78 

35 algorithm with the default parameters to identify said homologous nucleic acid or said nucleic acid 
encoding a homologous polypeptide in a database. 

278. The method of Paragraph 275 wherein said step (a) comprises identifying a 
homologous nucleic acid or a nucleic acid encoding a homologous polypeptide by identifying 
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nucleic acids comprising nucleotide sequences which hybridize to said nucleic acid having at least 
70% nucleotide sequence identity as determined using BLASTN version 2.0 with the default 
parameters to a nucleotide sequence selected from the group consisting of SEQ ID NOs. 1-6213 or 
the complement of the nucleotide sequence of said nucleic acid selected from the group consisting 
5 of SEQ ID NOs. 1-6213. 

279. The method of Paragraph 275 wherein step (a) comprises expressing a nucleic acid 
having at least 70% nucleic acid identity as determined using BLASTN version 2.0 with the default 
parameters to a sequence selected from the group consisting of SEQ ID NOs. 1-6213 in said test 
cell. 

10 280. The method of Paragraph 275, wherein step (a) comprises identifying a 

homologous nucleic acid or a nucleic acid encoding a homologous polypeptide in an test cell 
selected from the group consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus 
fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, 
Burkholderia cepacia, Burldwlderia fungorum, Burkholderia mallei, Campylobacter jejuni, 

15 Candida albicans, Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida 
parapsilosis, Candida guilliermondii, Candida krusei, Candida kejyr (also called Candida 
pseudotropicalis\ Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, 
Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, 
Coccidioides immitis, Corynebacterium diptheriae, Gyptococcus neoformans, Enterobacter 

20 cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, 
Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberadosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 

25 liaemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 

30 pneumoniae, Streptococcus f nutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahemolyticus, Vibrio vulniflcans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

281. The method of Paragraph 275, wherein step (a) comprises identifying a 
homologous nucleic acid or a nucleic acid encoding a homologous polypeptide in a test cell other 

35 thanis. coli. 

282. The method of Paragraph 275, wherein said inhibitory nucleic acid is an antisense 
nucleic acid. 



47 



WO 02/077183 PCT/US02/09107 

283. The method of Paragraph 275, wherein said inhibitory nucleic acid comprises an 
antisense nucleic acid to a portion of said homolog. 

284. The method of Paragraph 275, wherein said inhibitory nucleic acid comprises an 
antisense nucleic acid to a portion of the operon encoding said homolog. 

5 285. The method of Paragraph 275, wherein the step of contacting the cell with a 

sublethal level of said inhibitory nucleic acid comprises directly contacting said cell with said 
inhibitory nucleic acid. 

286. The method of Paragraph 275, wherein the step of contacting the cell with a 
sublethal level of said inhibitory nucleic acid comprises expressing an antisense nucleic acid to said 

10 homolog in said cell. 

287. The method of Paragraph 275, wherein said gene product comprises a polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ ID NOs.: 42398- 
78581. 

288. The method of Paragraph 275, wherein said gene comprises a nucleic acid selected 
15 from the group consisting of a nucleic acid comprising a nucleic acid having at least 70% 

nucleotide sequence identity as determined using BLASTN version 2.0 with the default parameters 
to a sequence selected from the group consisting of SEQ ID NOS.: 6214-42397, a nucleic acid 
comprising a nucleotide sequence which hybridizes to a nucleotide sequence selected from the 
group consisting of SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic acid 
20 comprising a nucleotide sequence which hybridizes to a nucleotide sequence selected from the 
group consisting of SEQ ID NOS.: 6214-42397 under moderate condtions. 

289. A compound identified using the method of Paragraph 275. 

290. A method of identifying a impound having the ability to inhibit proliferation 
comprising: 

25 (a) sensitizing a test cell by contacting said test cell with a sublethal level of an 

antisense nucleic acid, wherein said antisense nucleic acid is selected from the group 
consisting of a nucleic acid having at least 70% nucleotide sequence identity as determined 
using BLASTN version 2.0 with the default parameters to a nucleotide sequence selected 
from the group consisting of SEQ ID NOs. 1-6213 or a portion thereof which inhibits the 

30 proliferation of the cell from which said nucleic acid was obtained, a nucleic acid 

comprising a nucleotide sequence which hybridizes to a nucleic acid selected from the 
group consisting of SEQ ID NOs.: 1-6213 under stringent conditions, and a nucleic acid 
comprising a nucleotide sequence which hybridizes to a nucleic acid selected from the 
group consisting of SEQ ID NOs.: 1-6213 under moderate conditionst; 

35 (b) contacting the sensitized test cell of step (a) with a compound; and 

(c) determining the degree to which said compound inhibits proliferation of said 
sensitized test cell relative to a cell which does not contain said antisense nucleic acid. 
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291. The method of Paragraph 290, wherein said determining step comprises 
determining whether said compound inhibits proliferation of said sensitized test cell to a greater 
extent than said compound inhibits proliferation of a nonsensitized test cell. 

292. A compound identified using the method of Paragraph 290. 

5 293. The method of Paragraph 290, wherein said test cell is selected from the group 

consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, Bacillus 
anthracis, Bacleroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 

10 guilliermondii, Candida krusei, Candida kejyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 

15 Histoplasma capsidatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tubercidosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 

20 Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 

25 urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

294. The method of Paragraph 290, wherein the test cell is not E. coll 

295. . A method for identifying a compound having activity against a biological pathway 
required for proliferation comprising: 

30 (a) sensitizing a cell by providing a sublethal level of an antisense nucleic acid 

complementary to a nucleic acid encoding a gene product required for proliferation, 
wherein said gene product is selected from the group consisting of a gene product having at 
least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with the 
default parameters to a gene product whose expression is inhibited by an antisense nucleic 

35 acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 

1-6213, a gene product encoded by a nucleic acid having at least 70% nucleotide sequence 
identity as determined using BLASTN version 2.0 with the default parameters to a nucleic 
acid encoding a gene product whose expression is inhibited by an antisense nucleic acid 
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comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs:l- 
6213, a gene product having at least 25% amino acid identity as determined using FASTA 
version 3.0t78 with the default parameters to a gene product whose expression is inhibited 
by an antisense nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic acid comprising a 
nucleotide sequence which hybridizes to a nucleic acid selected from the group consisting 
of SEQ ID NOs.: 1-6213 under stringent conditions, a gene product encoded by a nucleic 
acid comprising a nucleotide sequence which hybridizes to a nucleic acid selected from the 
group consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene product 
whose activity may be complemented by the gene product whose activity is inhibited by a 
nucleic acid selected from the group consisting of SEQ ID NOs: 1-6213; 



(b) contacting the sensitized cell with a compound; and 
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(c) determining the extent to which said compound inhibits the growth of said 
sensitized cell relative to a cell which does not contain said antisense nucleic acid. 
296. The method of Paragraph 295, wherein said determining step comprises 



determining whether said compound inhibits the growth of said sensitized cell to a greater extent 
than said compound inhibits the growth of a nonsensitized cell. 

297. The method of Paragraph 295, wherein said cell is selected from the group 
consisting of bacterial cells, fungal cells, plant cells, and animal cells. 



299. The method of Paragraph 298, wherein said Gram positive bacterium is selected 
from the group consisting of Staphylococcus species, Streptococcus species, Enterococcus species, 
Mycobacterium species, Clostridium species, and Bacillus species. 

300. The method of Paragraph 299, wherein said Gram positive bacterium is 
25 Staphylococcus aureus. 

301. The method of Paragraph 298, wherein said Gram positive bacterium is selected 
from the group consisting of Staphylococcus aureus RN450 and Staphylococcus aureus RN4220. 

302. The method of Paragraph 295, wherein said cell is selected from the group 
consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, Bacillus 

30 anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, BurkJiolderia cepacia, 
Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabrata (also called Tondopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
guilliermondii, Candida lansei, Candida kefyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 

35 Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
Coiynebacterium ' diptheriae, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
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298. The method of Paragraph 295, wherein said cell is a Gram positive bacterium. 
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monocytogenes, Moraxella catarrlialis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides f Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 

5 Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae y Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 

10 urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vidnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

303. The method of Paragraph 295, wherein said cell is not an E. coli cell. 

304. The method of Paragraph 295, wherein said gene product is from an organism other 
than E. coli. 

15 305. The method of Paragraph 295, wherein said antisense nucleic acid is transcribed 

from an inducible promoter. 

306. The method of Paragraph 305, further comprising contacting the cell with an agent 
which induces expression of said antisense nucleic acid from said inducible promoter, wherein said 
antisense nucleic acid is expressed at a sublethal level. 
20 307. Hie method of Paragraph 295, wherein inhibition of proliferation is measured by 

monitoring the optical density of a liquid culture. 

308. Hie method of Paragraph 295, wherein said gene product comprises a polypeptide 
having at least 25% amino acid identity as determined using FASTA version 3.0t78 with the default 
parameters to a sequence selected from the group consisting of SEQ ID NOs.: 42398-78581. 
25 309. The method of Paragraph 295, wherein said nucleic acid encoding said gene 

product comprises a nucleic acid selected from the group consisting of a nucleic acid comprising a 
nucleic acid having at least 70% nucleotide sequence identity as determined using BLASTN 
version 2.0 with the default parameters to a nucleotide sequence selected from the group consisting 
of SEQ ID NOS.: 6214-42397, a nucleic acid comprising a nucleotide sequence which hybridizes to 
30 a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214-42397 under 
stringent conditions, and a nucleic acid comprising a nucleotide sequence which hybridizes to a 
nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214-42397 under 
moderate condtions. 

310. A compound identified using the method of Paragraph 295. 
35 311. A method for identifying a compound having the ability to, inhibit cellular 

proliferation comprising: 

(a) contacting a cell with an agent which reduces the activity or level of a gene 
product required for proliferation of said cell, wherein said gene product is selected from 
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the group consisting of a gene product having at least 70% nucleotide sequence identity as 
determined using BLASTN version 2.0 with the default parameters to a gene product 
whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product 
5 encoded by a nucleic acid having at least 70% nucleotide sequence identity as determined 

using BLASTN version 2.0 with the default parameters to a nucleic acid encoding a gene 
product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs: 1-621 3, a gene product having 
at least 25% amino acid identity as determined using FASTA version 3.0t78 with the 
10 default parameters to a gene product whose expression is inhibited by an antisense nucleic 

acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 
1-6213, a gene product encoded by a nucleic acid comprising a nucleotide sequence which 
hybridizes to a nucleic acid selected from the group consisting of SEQ ID NOs.: 1-6213 
under stringent conditions, a gene product encoded by a nucleic acid comprising a 
15 nucleotide sequence which hybridizes to a nucleic acid selected from the group consisting 

of SEQ ED NOs, : 1 -62 1 3 under moderate conditions, and a gene product whose activity 
may be complemented by the gene product whose activity is inhibited by a nucleic acid 
selected from the group consisting of SEQ ID NOs: 1-6213; 
(b) contacting said cell with a compound; and 
20 (c) determining the degree to which said compound reduces proliferation of said 

contacted cell relative to a cell which was not contacted with said agent. 
312. The method of Paragraph 311, wherein said determining step comprises 
determining whether said compound reduces proliferation of said contacted cell to a greater extent 
than said compound reduces proliferation of cells which have not been contacted with said agent. 
25 313. The method of Paragraph 311, wherein said cell is selected from the group 

consisting of Acinetobacter baumannii, Anaplasma margitiale, Aspergillus fumigatus, Bacillus 
anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burkholderia fimgorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabraia (also called Tondopsis glabratd), Candida tropicalis, Candida parapsilosis, Candida 
30 guilliermondii, Candida b'usei, Candida kefyr (also called Candida pseudotropicalis\ Candida 
dublmiensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium peifringens, Coccidioides immitis, 
Coiynebacterium diptheriae t Cryptococcus neoformans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
35 Histoplasma capsulation, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catairhalis, Mycobacterium avium, MycobacteHum bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
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haemolytica, Paskurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholemsuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimuriwn, SUgella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 

5 Staphylococcus emeus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streftococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis anJany species falling within the genera of any of the above species. 
314. Ik method of Paragraph 311, wherein said cell is not an E. coli cell. 

10 315. Tfce method of Paragraph 311, wherein said gene product is from an organism other 

than E. coli, 

316. Tfce method of Paragraph 311, wherein said agent which reduces the activity or 

level of a gene product required for proliferation of said cell comprises an antisense nucleic acid to 

a gene or operon required for proliferation. 
15 317. Hie method of Paragraph 311, wherein said agent which reduces the activity or 

level of a gene product required for proliferation of said cell comprises a compound known to 

inhibit growth or proliferation of a cell. 

318. Hie method of Paragraph 311, wherein said cell contains a mutation which reduces 

the activity or levd of said gene product required for proliferation of said cell. 
20 319. The method of Paragraph 311, wherein said mutation is a temperature sensitive 

mutation. 

320. The method of Paragraph 311, wherein said gene product comprises a gene product 
comprises a polypeptide having at least 25% amino acid identity as determined using FASTA 
version 3.0t78 with the default parameters to an amino acid sequence selected from the group 

25 consisting of SEQEDNOs.: 42398-78581. 

321. A compound identified using the method of Paragraph 311. 

322. A method for identifying the biological pathway in which a proliferation-required 
gene product or a gene encoding a proliferation-required gene product lies comprising: 

(a) providing a sublethal level of an antisense nucleic acid which inhibits the 
30 activity or reduces the level of said gene encoding a proliferation-required gene product or 

said said proliferation-required gene product in a test cell, wherein said proliferation- 
required gene product is selected from the group consisting of a gene product having at 
least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with the 
default parameters to a gene product whose expression is inhibited by an antisense nucleic 
35 acid comprising a nucleotide sequence selected from the group consisting of SEQ 3D NOs.: 

1-6213, a gene product encoded by a nucleic acid having at least 70% nucleotide sequence 
identity as determined using BLASTN version 2.0 with the default parameters to a nucleic 
acid encoding a gene product whose expression is inhibited by an antisense nucleic acid 
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comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs:l- 
6213, a gene product having at least 25% amino acid identity as determined using FASTA 
version 3.0t78 with the default parameters to a gene product whose expression is inhibited 
by an antisense nucleic acid comprising a nucleotide sequence selected from the group 
5 consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic acid comprising a 

nucleotide sequence which hybridizes to a nucleic acid selected from the group consisting 
of SEQ ID NOs.: 1-6213 under stringent conditions, a gene product encoded by a nucleic 
acid comprising a nucleotide sequence which hybridizes to a nucleic acid selected from the 
group consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene product 
10 whose activity may be complemented by the gene product whose activity is inhibited by a 

nucleic acid selected from the group consisting of SEQ ID NOs: 1-6213; 

(b) contacting said test cell with a compound known to inhibit growth or 
proliferation of a cell, wherein the biological pathway on which said compound acts is 
known; and 

15 (c) determining the degree to which said compound inhibits proliferation of said 

test cell relative to a cell which does not contain said antisense nucleic acid. 
323. The method of Paragraph 322, wherein said determining step comprises 
determining whether said test cell has a substantially greater sensitivity to said compound than a 
cell which does not express said sublethal level of said antisense nucleic acid. 
20 324. The method of Paragraph 322, wherein said gene product comprises a polypeptide 

having at least 25% amino acid identity as determined using FASTA version 3.0t78 with the default 
parameters to an amino acid sequence selected from the group consisting of SEQ ID NOs.: 42398- 
78581. 

325. The method of Paragraph 322, wherein said test cell is selected from the group 
25 consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus jumigatus, Bacillus 
anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burldiolderia fitngorum, Burldiolderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
guilliermondii, Candida Icrusei, Candida kefyr (also called Candida pseudotropicalis), Candida 
30 dabliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
Corynebacterium diptheriae, Cryptococcus neofonnans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
35 monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
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Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholermuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
5 pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio par ahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

326. Tl* method of Paragraph 322, wherein said test cell is not an E. coli cell 

327. Tte method of Paragraph 322, wherein said gene product is from an organism other 
10 than E. coli. 

328. A method for determining the biological pathway on which a test compound acts 
comprising: 

(a) providing a sublethal level of an antisense nucleic acid complementary to a 
proliferation-required nucleic acid in a cell, thereby producing a sensitized cell, wherein 

15 said antisense nucleic acid is selected from the group consisting of a nucleic acid having at 

least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with the 
default parameters to a nucleotide sequence selected from the group consisting of SEQ ID 
NOs: 1-6213 or a proliferation-inhibiting portion thereof,a nucleic acid comprising a 
nucleotide sequence which hybridizes to a nucleic acid selected from the group consisting 

20 of SEQ ID NOs.: 1-6213 under stringent conditions, and a nucleic acid comprising a 

nucleotide sequence which hybridizes to a nucleic acid selected from the group consisting 
of SEQ ID NOs.: 1 -62 1 3 under moderate conditions and wherein the biological pathway in 
which said proliferation-required nucleic acid or a protein encoded by said proliferation- 
required polypeptide lies is known, 

25 (b) contacting said cell with said test compound; and 

(c) determining the degree to which said compound inhibits proliferation of said 
sensitized cell relative to a cell which does not contain said antisense nucleic acid. 

329. The method of Paragraph 328, wherein said determining step comprises 
determining whether said sensitized cell has a substantially greater sensitivity to said test compound 

30 than a cell which does not express said sublethal level of said antisense nucleic acid. 

330. The method of Paragraph 328, further comprising: 

(d) providing a sublethal level of a second antisense nucleic acid complementary to 
a second proliferation-required nucleic acid in a second cell, wherein said second 
proliferation-required nucleic acid is in a different biological pathway than said 

35 proliferation-required nucleic acid in step (a); and 

(e) determining whether said second cell does not have a substantially greater 
sensitivity to said test compound than a cell which does not express said sublethal level of 
said second antisense nucleic acid, wherein said test compound is specific for the biological 
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pathway against which the antisense nucleic acid of step (a) acts if said sensitized cell has 
substantially greater sensitivity to said test compound than said second cell. 
33 1. The method of Paragraph 328, wherein said sensitized cell is selected from the 
group consisting of Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, 

5 Bacillus anthracis, Bacteroides fiugilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia 
cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, 
Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, 
Candida guilliemiondii, Candida Jcrusei, Candida kefyr (also called Candida pseudotropicalis), 
Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium 

10 acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium perfiingens, 
Coccidioides immitis, Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter 
cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, 
Listeria monocytogenes, Moraxella catairhalis, Mycobacterium avium, Mycobacterium bovis, 

15 Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 

20 typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Sliigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 

25 332. The method of Paragraph 328, wherein said sensitized cell is not an E. coli cell. 

333. The method of Paragraph 328, wherein said proliferation-required nucleic acid is 
from an organism other than E. coli. 

334. A compound which inhibits proliferation by interacting with a gene encoding a 
gene product required for proliferation or with a gene product required for proliferation, wherein 

30 said gene product is selected from the group consisting of a gene product having at least 70% 
nucleotide sequence identity as determined using BLASTN version 2.0 with the default parameters 
to a gene product whose expression is inhibited by an antisense nucleic acid comprising a 
nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product 
encoded by a nucleic acid having at least 70% nucleotide sequence identity as determined using 

35 BLASTN version 2.0 with the default parameters to a nucleic acid encoding a gene product whose 
expression is inhibited by an antisense nucleic acid comprising a nucleotide sequence selected from 
the group consisting of SEQ ID NOs: 1-6213, a gene product having at least 25% amino acid 
identity as determined using FASTA version 3.0t78 with the default parameters to a gene product 
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whose expression is inhibited by an antisense nucleic acid comprising a nucleotide sequence 
selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic 
acid comprising a nucleotide sequence which hybridizes to a nucleic acid selected from the group 
consisting of SEQ ID NOs.: 1-6213 under stringent conditions, a gene product encoded by a nucleic 
5 acid comprising a nucleotide sequence which hybridizes to a nucleic acid selected from the group 
consisting of SEQ ED NOs.: 1-6213 under moderate conditions, and a gene product whose activity 
may be complemented by the gene product whose activity is inhibited by a nucleic acid selected 
from the group consisting of SEQ ID NOs: 1-6213. 

335. The compound of Paragraph 334, wherein said gene product comprises a 
10 polypeptide having at least 25% amino acid identity as determined using FASTA version 3.0t78 

with the default parameters to a sequence selected from the group consisting of SEQ ID NOs.: 
42398-78581. 

336. The compound of Paragraph 334, wherein said gene comprises a nucleic acid 
selected from the group consisting of a nucleic acid comprising a nucleic acid having at least 70% 

15 nucleotide sequence identity as determined using BLASTN version 2.0 with the default parameters 
to a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214-42397, a 
nucleic acid comprising a nucleotide sequence which hybridizes to a nucleotide sequence selected 
from the group consisting of SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic 
acid comprising a nucleotide sequence which hybridizes to a nucleotide sequence selected from the 

20 group consisting of SEQ ID NOS.: 6214-42397 under moderate condtions. 

337. A method for manufacturing an antibiotic comprising the steps of: 
screening one or more candidate compounds to identify a compound that reduces the 

activity or level of a gene product required for proliferation wherein said gene product is selected 
from the group consisting of a gene product having at least 70% nucleotide sequence identity as 

25 determined using BLASTN version 2.0 with the default parameters to a gene product whose 

expression is inhibited by an antisense nucleic acid comprising a nucleotide sequence selected from 
the group consisting of SEQ 3D NOs.: 1-6213, a gene product encoded by a nucleic acid having at 
least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with the default 
parameters to a nucleic acid encoding a gene product whose expression is inhibited by an antisense 

30 nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ ID 

NOs: 1-6213, a gene product having at least 25% amino acid identity as determined using FASTA 
version 3.0t78 with the default parameters to a gene product whose expression is inhibited by an 
antisense nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 
ID NOs.: 1-6213, a gene product encoded by a nucleic acid comprising a nucleotide sequence 

35 which hybridizes to a nucleic acid selected from the group consisting of SEQ ID NOs.: 1-6213 
under stringent conditions, a gene product encoded by a nucleic acid comprising a nucleotide 
sequence which hybridizes to a nucleic acid selected from the group consisting of SEQ ED NOs.: 1- 
6213 under moderate conditions, and a gene product whose activity may be complemented by the 
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gene product whose activity is inhibited by a nucleic acid selected from the group consisting of 
SEQIDNOs: 1-6213 ; and 

manufacturing the compound so identified. 

338. Hie method of Paragraph 337, wherein said screening step comprises performing 
5 any one of the methods of Paragraphs 205, 21 1, 222, 275, 290, 295, 311. 

339. The method of Paragraph 337, wherein said gene product comprises a polypeptide 
having at least 25% amino acid identity as determined using FASTA version 3.0t78 with the default 
parameters to an amino acid sequence selected from the group consisting of SEQ ID NOs.: 42398- 
78581. 

1 0 340. A method for inhibiting proliferation of a cell in a subject comprising administering 

an effective amount of a compound that reduces the activity or level of a gene product required for 
proliferation of said cell, wherein said gene product is selected from the group consisting of a gene 
product having at least 70% nucleotide sequence identity as determined using BLASTN version 2.0 
with the default parameters to a gene product whose expression is inhibited by an antisense nucleic 

15 acid comprising anucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, 
a gene product encoded by a nucleic acid having at least 70% nucleotide sequence identity as 
determined using BLASTN version 2.0 with the default parameters to a nucleic acid encoding a 
gene product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs: 1-6213, a gene product having at least 

20 25% amino acid identity as determined using FASTA version 3.0t78 with the default parameters to 
a gene product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a 
nucleic acid comprising a nucleotide sequence which hybridizes to a nucleic acid selected from the 
group consisting of SEQ ID NOs.: 1-6213 under stringent conditions, a gene product encoded by a 

25 nucleic acid comprising a nucleotide sequence which hybridizes to a nucleic acid selected from the 
group consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene product whose 
activity may be complemented by the gene product whose activity is inhibited by a nucleic acid 
selected from the group consisting of SEQ ID NOs: 1-6213. 

341. The method of Paragraph 340 wherein said subject is selected from the group 
30 consisting of vertebrates, mammals, avians, and human beings. 

342. The method of Paragraph 340, wherein said gene product comprises a polypeptide 
having at least 25% amino acid identity as determined using FASTA version 3.0t78 with the default 
parameters to an amino acid sequence selected from the group consisting of SEQ ID NOs.: 42398- 
78581. 

35 343. The method of Paragraph 340, wherein said cell is selected from the group 

consisting of Adiietobacter baumannii, Anaplasma marginale, Aspergillus fumigatiis, Bacillus 
anthracis, Bactewides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholdeiia cepacia, 
Burkholderia fungorum, BurJdiolderia mallei, Campylobacter jejuni, Candida albicans, Candida 
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glabrata (also called Tomlopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
guilliermondih Candida krusei, Candida kefyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 

5 Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 

10 pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
liaemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 

15 Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis and any species falling within the genera of any of the above species. 
344. The method of Paragraph 340, wherein said cell is not E. coli, 

20 345. The method of Paragraph 340, wherein said gene product is from an organism other 

than E. coli. 

346. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

obtaining a culture comprising a plurality of strains wherein each strain in said 
25 culture overexpresses a different gene product which is essential for proliferation of said 

organism, wherein said culture comprises a strain in which a gene product whose activity or 
level is inhibited by a nucleic acid comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NOs.: 1-6213 is overexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 
30 the proliferation of strains of said organism which do not overexpress said gene product on 

which said compound acts, such that strains which overexpress said gene product on which 
said compound acts proliferate more rapidly than strains which do not overexpress said 
gene product on which said compound acts; and 

identifying the gene product which is overexpressed in a strain which proliferated 
35 more rapidly in said culture. 

347. A "method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 
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obtaining a culture comprising a plurality of strains wherein each strain in said 
culture overexpresses a different gene product which is essential for proliferation of said 
organism, wherein said culture comprises a strain in which a gene product encoded by a 
nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 
5 ID NOs.: 6214-42397 is overexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 
the proliferation of strains of said organism which do not overexpress said gene product on 
which said compound acts, such that strains which overexpress said gene product on which 
said compound acts proliferate more rapidly than strains which do not overexpress said 
1 o gene product on which said compound acts; and 

identifying the gene product which is overexpressed in a strain which proliferated 
more rapidly in said culture. 

348. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

15 obtaining a culture comprising a plurality of strains wherein each strain in said 

culture overexpresses a different gene product which is essential for proliferation of said 
organism, wherein said culture comprises a strain in which a gene product comprising an 
amino acid sequence selected from the group consisting of SEQ ID NOs.: 42938-78581 is 
overexpressed; 

20 contacting said culture with a sufficient concentration of said compound to inhibit 

the proliferation of strains of said organism which do not overexpress said gene product on 
which said compound acts, such that strains which overexpress said gene product on which 
said compound acts proliferate more rapidly than strains which do not overexpress said 
gene product on which said compound acts; and 

25 identifying the gene product which is overexpressed in a strain which proliferated 

more rapidly in said culture. 

349. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

obtaining a culture comprising a plurality of strains wherein each strain in said 
30 culture overexpresses a different gene product which is essential for proliferation of said 

organism, wherein said culture comprises a strain in which a gene product selected from the 
group consisting of a gene product having at least 70% nucleotide sequence identity as 
determined using BLASTN version 2.0 with the default parameters to a gene product 
whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
35 sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product 

encoded by a nucleic acid having at least 70% nucleotide sequence identity as determined 
using BLASTN version 2.0 with the default parameters to a nucleic acid encoding a gene 
product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
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sequence selected from the group consisting of SEQ ID NOs: 1-6213, a gene product 
having at least 25% amino acid identity as determined using FASTA version 3.0t78 with 
the default parameters to a gene product whose expression is inhibited by an antisense 
nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 
5 ID NOs.: 1-6213, a gene product encoded by a nucleic acid which hybridizes to a nucleic 

acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 
1-6213 under stringent conditions, a gene product encoded by a nucleic acid which 
hybridizes to a nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene product whose 
10 activity may be complemented by the gene product whose activity is inhibited by a nucleic 

acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 
1-6213 is overexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 
the proliferation of strains of said organism which do not overexpress said gene product on 
15 which said compound acts, such that strains which overexpress said gene product on which 

said compound acts proliferate more rapidly than strains which do not overexpress said 
gene product on which said compound acts; and 

identifying the gene product which is overexpressed in a strain which proliferated 
more rapidly in said culture. 
20 350. A method for identifying the gene product on which a compound which inhibits 

proliferation of an organism acts comprising: 

obtaining a culture comprising a plurality of strains wherein each strain in said 
culture overexpresses a different gene product which is essential for proliferation of said 
organism, wherein said culture comprises a strain in which a gene product encoded by a 
25 nucleic acid comprising a nucleotide sequence selected from the group consisting of a 

nucleic acid comprising a nucleic acid having at least 70% nucleotide sequence identity as 
determined using BLASTN version 2.0 with the default parameters to a nucleotide 
sequence selected from the group consisting of SEQ ID NOS.: 6214-42397, a nucleic acid 
comprising a nucleotide sequence which hybridizes to a sequence selected from the group 
30 consisting of SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic acid 

comprising a nucleotide sequence which hybridizes to a nucleotide sequence selected from 
the group consisting of SEQ ED NOS.: 6214-42397 under moderate conditions is 
overexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 
35 the proliferation of strains of said organism which do not overexpress said gene product on 

which said' compound acts, such that strains which overexpress said gene product on which 
said compound acts proliferate more rapidly than strains which do not overexpress said 
gene product on which said compound acts; and 
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identifying the gene product which is overexpressed in a strain which proliferated 
more rapidly in said culture. 

351. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 
5 obtaining a culture comprising a plurality of strains wherein each strain in said 

culture overexpresses a different gene product which is essential for proliferation of said 
organism, wherein said culture comprises a strain in which a gene product comprises a 
polypeptide selected from the group consisting of a polypeptide having at least 25% amino 
acid identity as determined using FASTA version 3.0t78 to a polypeptide selected from the 
10 group consisting of SEQ ID NOs.: 42938-78581 and a polypeptide whose activity may be 

complemented by a polypeptide selected from the group consisting of SEQ ID NOs: 42938- 
78581 is overexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 
the proliferation of strains of said organism which do not overexpress said gene product on 
15 which said compound acts, such that strains which overexpress said gene product on which 

said compound acts proliferate more rapidly than strains which do not overexpress said 
gene product on which said compound acts; and 

identifying the gene product which is overexpressed in a strain which proliferated 
more rapidly in said culture. 
20 352. The method of Paragraph 346, 347, 348, 349, 350 or 351, wherein said culture 

includes at least one strain which does not overexpresses a gene product which is essential for 
proliferation of said organism. 

353. The method of Paragraph 346, 347, 348, 349, 350 or 351, wherein said strains 
which overexpress said gene products comprise a nucleic acid encoding said gene product which is 

25 essential for proliferation of said organism operably linked to a regulatable promoter. 

354. The method of Paragraph 346, 347, 348, 349, 350 or 351, wherein said strains 
which overexpress said gene products a nucleic acid encoding said gene product which is essential 
for proliferation of said organism operably linked to a constitutive promoter. 

355. The method of Paragraph 346, 347, 348, 349, 350 or 351, wherein said 
30 identification step comprises determining the nucleotide sequence of a nucleic acid encoding said 

gene product in said cell which proliferated more rapidly in said culture. 

356. The method of Paragraph 346, 347, 348, 349, 350 or 351, wherein said 
identification step comprises performing an amplification reaction to identify the nucleic acid 
encoding said gene product in said cell which proliferated more rapidly in said cell culture. 

35 357. The method of Paragraph 356, wherein the products of said amplification reaction 

are labeled with a detectable dye. 

358. The method of Paragraph 346, 347, 348, 349, 350 or 351, wherein said 
identification step comprises performing a hybridization procedure. 
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359. The method of Paragraph 346, 347, 348, 349, 350 or 351, wherein said 
identification step comprises contacting a nucleic acid array with a nucleic acid encoding said gene 
product in said cell which proliferated more rapidly in said cell culture. 

360. The method of Paragraph 346, 347, 348, 349, 350 or 351, wherein said organism is 
5 selected from the group consisting of bacteria,, fungi, and protozoa. 

361. The method of Paragraph 346, 347, 348, 349, 350 or 351, wherein said culture is a 
culture of an organism selected from the group consisting of Acinetobacter baumannii, Anaplasma 
marginale, Aspergillus fumigatus, Bacillus anthracis, Bacteroides flagilis, Bordetella pertussis, 
Borrelia burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkholdeiia mallei, 

10 Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), 
Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida krusei, Candida kefyr 
(also called Candida pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia 
trachomatis, Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium 
perfringens, Cooddioides immitis, Corynebacterium diptheriae, Cryptococcus neoformans, 

15 Enterobacter cloacae, Entei'ococcus faecalis, Enterococcus faecium, Escherichia coli, 
Haemophilus influenzae, Helicobacter pylori, Histoplasma capsulatwn, Klebsiella pneumoniae, 
Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, 
Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma 
genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia 

20 asteroides, Pasteurella haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus 
mirabilis, Proteus vulgaris, Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas 
syringae, Salmonella bongori, Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, 
Salmonella typhi, Salmonella typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, 
Shigella sonnei, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, 

25 Streptococcus pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, 
Ureaplasma ureafyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia 
enterocolitica, Yersinia pestis and any species falling within the genera of any of the above species. 

362. ITie method of Paragraph 346, 347, 348, 349, 350 or 351, wherein said compound 
is obtained from a library of natural compounds. 

30 363. ITie method of Paragraph 346, 347, 348, 349, 350 or 351, wherein said compound 

is obtained from a library of synthetic compounds. 

364. He method of Paragraph 346, 347, 348, 349, 350 or 351, wherein said compound 
is present in a crude or partially purified state. 

365. Tbz method of Paragraph 346, 347, 348, 349, 350 or 351, further comprising 
35 determining whether said gene product in said strain which proliferated more rapidly in said culture 

has a counterpart in at least one other organism. 

366. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 
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obtaining an array of strains on a solid growth medium wherein each strain 
overexpresses a different gene product which is essential .for proliferation of said organism, 
wherein said culture comprises a strain in which a gene product whose activity or level is 
inhibited by a nucleic acid comprising a nucleotide sequence selected from the group 
5 consisting of SEQ ID NOs.: 1-6213 is overexpressed; 

contacting said array of strains with a sufficient concentration of said compound to 
inhibit the proliferation of strains of said organism which do not overexpress said gene 
product on which said compound acts, such that strains which overexpress said gene 
product on which said compound acts proliferate more rapidly than strains which do not 
10 overexpress said gene product on which said compound acts; and 

identifying the gene product which is overexpressed in a strain which proliferated 
more rapidly on said solid medium. 

367. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

15 obtaining an array of strains on a solid growth medium wherein each strain 

overexpresses a different gene product which is essential for proliferation of said organism, 
wherein said culture comprises a strain in which a gene product encoded by a nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 
6214-42397 is overexpressed; 

20 contacting said array of strains with a sufficient concentration of said compound to 

inhibit the proliferation of strains of said organism which do not overexpress said gene 
product on which said compound acts, such that strains which overexpress said gene 
product on which said compound acts proliferate more rapidly than strains which do not 
overexpress said gene product on which said compound acts; and 

25 identifying the gene product which is overexpressed in a strain which proliferated 

more rapidly on said solid medium. 

368. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

obtaining an array of strains on a solid growth medium wherein each strain 
30 overexpresses a different gene product which is essential for proliferation of said organism, 

wherein said culture comprises a strain in which a gene product comprising an amino acid 
sequence selected from the group consisting of SEQ ID NOs.: 42938-78581 is 
overexpressed; 

contacting said array of strains with a sufficient concentration of said compound to 
35 inhibit the proliferation of strains of said organism which do not overexpress said gene 

product on which said compound acts, such that strains which overexpress said gene 
product on which said compound acts proliferate more rapidly than strains which do not 
overexpress said gene product on which said compound acts; and 
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identifying the gene product which is overexpressed in a strain which proliferated 
more rapidly on said solid medium. 

369. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

5 obtaining an array of strains on a solid growth medium wherein each strain 

overexpresses a different gene product which is essential for proliferation of said organism, 
wherein said culture comprises a strain in which a gene product selected from the group 
consisting of a gene product having at least 70% nucleotide sequence identity as determined 
using BLASTN version 2.0 with the default parameters to a gene product whose expression 

10 is inhibited by an antisense nucleic acid comprising a nucleotide sequence selected from the 

group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic acid having 
at least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with 
the default parameters to a nucleic acid encoding a gene product whose expression is 
inhibited by an antisense nucleic acid comprising a nucleotide sequence selected from the 

15 group consisting of SEQ ID NOs: 1-6213, a gene product having at least 25% amino acid 

identity as determined using FASTA version 3.0t78 with the default parameters to a gene 
product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product 
encoded by a nucleic acid which hybridizes to a nucleic acid comprising a nucleotide 

20 . sequence selected from the group consisting of SEQ ID NOs.: 1-6213 under stringent , 

conditions, a gene product encoded by a nucleic acid which hybridizes to a nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1- 
6213 under moderate conditions, and a gene product whose activity may be complemented 
by the gene product whose activity is inhibited by a nucleic acid comprising a nucleotide 

25 sequence selected from the group consisting of SEQ ED NOs: 1-6213 is overexpressed; 

contacting said array of strains with a sufficient concentration of said compound to 
inhibit the proliferation of strains of said organism which do not overexpress said gene 
product on which said compound acts, such that strains which overexpress said gene 
product on which said compound acts proliferate more rapidly than strains which do not 

30 overexpress said gene product on which said compound acts; and 

identifying the gene product which is overexpressed in a strain which proliferated 
more rapidly on said solid medium. 

370. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

35 obtaining an array of strains on a solid growth medium wherein each strain 

overexpresses a different gene product which is essential for proliferation of said organism, 
wherein said culture comprises a strain in which a gene product encoded by a nucleic acid 
comprising a nucleotide sequence selected from the group consisting of a nucleic acid 
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comprising a nucleic acid having at least 70% nucleotide sequence identity as determined 
using BLASTN version 2.0 with the default parameters to a nucleotide sequence selected 
from the group consisting of SEQ ID NOS.: 6214-42397, a nucleic acid comprising a 
nucleotide sequence which hybridizes to a sequence selected from the group consisting of 
5 SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic acid comprising a 

nucleotide sequence which hybridizes to a nucleotide sequence selected from the group 
consisting of SEQ ID NOS.: 6214-42397 under moderate conditions is overexpressed; 

contacting said array of strains with a sufficient concentration of said compound to 
inhibit the proliferation of strains of said organism which do not overexpress said gene 
10 product on which said compound acts, such that strains which overexpress said gene 

product on which said compound acts proliferate more rapidly than strains which do not 
overexpress said gene product on which said compound acts; and 

identifying the gene product which is overexpressed in a strain which proliferated 
more rapidly on said solid medium. 
15 371. A method for identifying the gene product on which a compound which inhibits 

proliferation of an organism acts comprising: 

obtaining an array of strains on a solid growth medium wherein each strain 
overexpresses a different gene product which is essential for proliferation of said organism, 
wherein said culture comprises a strain in which a gene product comprises a polypeptide 
20 selected from the group consisting of a polypeptide having at least 25% amino acid identity 

as determined using FASTA version 3.0t78 to a polypeptide selected from the group 
consisting of SEQ ID NOs.: 42938-78581 and a polypeptide whose activity may be 
complemented by a polypeptide selected from the group consisting of SEQ ID NOs: 42938- 
78581 is overexpressed; 

25 contacting said array of strains with a sufficient concentration of said compound to 

inhibit the proliferation of strains of said organism which do not overexpress said gene 
product on which said compound acts, such that strains which overexpress said gene 
product on which said compound acts proliferate more rapidly than strains which do not 
overexpress said gene product on which said compound acts; and 

30 identifying the gene product which is overexpressed in a strain which proliferated 

more rapidly on said solid medium. 

372. The method of Paragraph 366, 367, 368, 369, 370 or 371, wherein at least one 
strain in said array does not overexpresses a gene product which is essential for proliferation of said 
organism. 

35 373. A method for identifying the gene product on which a compound which inhibits 

proliferation of an organism acts comprising: 

obtaining a plurality of cultures, wherein each culture comprises a plurality of 
strains wherein each strain overexpresses a different gene product which is essential for 
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proliferation of said organism, wherein said culture comprises a strain in which a gene 
product whose activity or level is inhibited by a nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 1-6213 is overexpressed; 

contacting each of said cultures with a different concentration of said compound; 

5 and 

identifying the gene product which is overexpressed in a strain whose proliferation 
is inhibited by said compound. 

374. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

10 obtaining a plurality of cultures, wherein each culture comprises a plurality of 

strains wherein each strain overexpresses a different gene product which is essential for 
proliferation of said organism, wherein said culture comprises a strain in which a gene 
product encoded by a nucleic acid comprising a nucleotide sequence selected from the 
group consisting of SEQ IDNOs.: 6214-42397 is overexpressed; 

15 contacting each of said cultures with a different concentration of said compound; 

and 

identifying the gene product which is overexpressed in a strain whose proliferation 
is inhibited by said compound. 

375. A method for identifying the gene product on which a compound which inhibits 
20 proliferation of an organism acts comprising: 

obtaining a plurality of cultures, wherein each culture comprises a plurality of 
strains wherein each strain overexpresses a different gene product which is essential for 
proliferation of said organism, wherein said culture comprises a strain in which a gene 
product comprising an amino acid sequence selected from the group consisting of SEQ ID 
25 NOs.: 42938-78581 is overexpressed; 

contacting each of said cultures with a different concentration of said compound; 

and 

identifying the gene product which is overexpressed in a strain whose proliferation 
is inhibited by said compound. 
30 376. A method for identifying the gene product on which a compound which inhibits, 

proliferation of an organism acts comprising: 

obtaining a plurality of cultures, wherein each culture comprises a plurality of 
- strains wherein each strain overexpresses a different gene product which is essential for 
proliferation of said organism, wherein said culture comprises a strain in which a gene 
35 product selected from the group consisting of a gene product having at least 70% nucleotide 

sequence identity as determined using BLASTN version 2.0 with the default parameters to 
a gene product whose expression is inhibited by an antisense nucleic acid comprising a 
nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene 
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product encoded by a nucleic acid having at least 70% nucleotide sequence identity as 
determined using BLASTN version 2.0 with the default parameters to a nucleic acid 
encoding a gene product whose expression is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 1- 

5 6213, a gene product having at least 25% amino acid identity as determined using FASTA 

version 3.0t78 with the default parameters to a gene product whose expression is inhibited 
by an antisense nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic acid which 
hybridizes to a nucleic acid comprising a nucleotide sequence selected from the group 

10 consisting of SEQ ID NOs.: 1-6213 under stringent conditions, a gene product encoded by 

a nucleic acid which hybridizes to a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of SEQ ED NOs.: 1-6213 under moderate conditions, and a gene 
product whose activity may be complemented by the gene product whose activity is 
inhibited by a nucleic acid comprising a nucleotide sequence selected from the group 

15 consisting of SEQ ID NOs: 1-6213 is overexpressed; 

contacting each of said cultures with a different concentration of said compound; 

and 

identifying the gene product which is overexpressed in a strain whose proliferation 
is inhibited by said compound. 
20 377. A method for identifying the gene product on which a compound which inhibits 

proliferation of an organism acts comprising: 

obtaining a plurality of cultures, wherein each culture comprises a plurality of 
strains wherein each strain overexpresses a different gene product which is essential for 
proliferation of said organism, wherein said culture comprises a strain in which a gene 
25 product encoded by a nucleic acid comprising a nucleotide sequence selected from the 

group consisting of a nucleic acid comprising a nucleic acid having at least 70% nucleotide 
sequence identity as determined using BLASTN version 2.0 with the default parameters to 
a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214-42397, a 
nucleic acid comprising a nucleotide sequence which hybridizes to a sequence selected 
30 from the group consisting of SEQ ID NOS.: 6214-42397 under stringent conditions, and a 

nucleic acid comprising a nucleotide sequence which hybridizes to a nucleotide sequence 
selected from the group consisting of SEQ ID NOS.: 6214-42397 under moderate 
conditions is overexpressed; 

contacting each of said cultures with a different concentration of said compound; 

35 and 

identifying the gene product which is overexpressed in a strain whose proliferation 
is inhibited by said compound. 
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378- A method for identifying the gene product on which a compound which lninbits 
proliferation of an organism acts comprising: 

obtaining a plurality of cultures, wherein each culture comprises a plurality of 
strains wherein each strain overexpresses a different gene product which is essential for 
5 proliferation of said organism, wherein said culture comprises a strain in which a gene 

product comprises a polypeptide selected from the group consisting of a polypeptide having 
at least 25% amino acid identity as determined using FASTA version 3.0t78 to a 
polypeptide selected from the group consisting of SEQ ID NOs.: 42938-78581 and a 
polypeptide whose activity may be complemented by a polypeptide selected from the group 
10 consisting of SEQ ID NOs: 42938-78581 is overexpressed; 

contacting each of said cultures with a different concentration of said compound; 

and 

identifying the gene product which is overexpressed in a strain whose proliferation 
is inhibited by said compound. 
15 379. The method of Paragraph 373, 374, 375, 376, 377 or 378, wherein at least one 

strain in said plurality of cultures does not overexpress a gene product which is essential for 
proliferation of said organism. 

380. A method of profiling a compound's activity comprising: 

performing the method of Paragraph 346 on a first culture using a first compound; 
20 performing the method of Paragraph 346 on a second culture using a second 

compound; and 

comparing the strains identified in said first culture to the strains identified in said 
second culture. 

381. A method of profiling a compound's activity comprising: 

25 performing the method of Paragraph 347 on a first culture using a first compound; 

performing the method of Paragraph 347 on a second culture using a second 
compound; and 

comparing the strains identified in said first culture to the strains identified in said 
second culture. 

30 382. A method of profiling a compound's activity comprising: 

performing the method of Paragraph 348 on a first culture using a first compound; 
performing the method of Paragraph 348 on a second culture using a second 
compound; and 

comparing the strains identified in said first culture to the strains identified in said 
35 second culture. 

383. A method of profiling a compound's activity comprising: 

performing the method of Paragraph 349 on a first culture using a first compound; 
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performing the method of Paragraph 349 on a second culture using a second 
compound; and 

comparing the strains identified in said first culture to the strains identified in said 
second culture. 

5 3 84. A method of profiling a compound's activity comprising: 

performing the method of Paragraph 350 on a first culture using a first compound; 
performing the method of Paragraph 350 on a second culture using a second 
compound; and 

comparing the strains identified in said first culture to the strains identified in said 
10 second culture. 

385. A method of profiling a compound' s activity comprising: 

performing the method of Paragraph 351 on a first culture using a first compound; 
performing the method of Paragraph 351 on a second culture using a second 
compound; and 

15 comparing the strains identified in said first culture to the strains identified in said 

second culture. 

386. A method of profiling a first compound's activity comprising: 

growing an array of strains on a first solid medium comprising said first compound 
and on a second solid medium comprising a second compound, wherein each strain in said 

20 array overexpresses a different gene product which is essential for proliferation of an 

organism, wherein said culture comprises a strain in which a gene product whose activity or 
level is inhibited by a nucleic acid comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NOs.: 1-6213 is overexpressed, and wherein said first 
compound and said second compound inhibit the proliferation of said organism; and 

25 comparing the pattern of strains which grow on said first solid medium with the 

pattern of strains which grow on said second solid medium. 
3 87. A method of profiling a first compound's activity comprising: 

growing an array of strains on a first solid medium comprising said first compound 
and on a second solid medium comprising a second compound, wherein each strain in said 

30 array overexpresses a different gene product which is essential for proliferation of an 

organism, wherein said culture comprises a strain in which a gene product encoded by a 
nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 
ID NOs.: 6214-42397 is overexpressed, and wherein said first compound and said second 
compound inhibit the proliferation of said organism; and 

35 comparing the pattern of strains which grow on said first solid medium with the 

pattern of strains which grow on said second solid medium. 
388. A method of profiling a first compound's activity comprising: 



70 



WO 02/077183 PCT/US02/09107 

growing an array of strains on a first solid medium comprising said first compound 
and on a second solid medium comprising a second compound, wherein each strain in said 
array overexpresses a different gene product which is essential for proliferation of an 
organism, wherein said culture comprises a strain in which a gene product comprising an 
5 amino acid sequence selected from the group consisting of SEQ ID NOs.: 42938-78581 is 

overexpressed, and wherein said first compound and said second compound inhibit the 
proliferation of said organism; and 

comparing the pattern of strains which grow on said first solid medium with the 
pattern of strains which grow on said second solid medium. 

10 3 89 . A method of profiling a First compound' s activity comprising: 

growing an array of strains on a first solid medium comprising said first compound 
and on a second solid medium comprising a second compound, wherein each strain in said 
array overexpresses a different gene product which is essential for proliferation of an 
organism, wherein said culture comprises a strain in which a gene product selected from the 

15 group consisting of a gene product having at least 70% nucleotide sequence identity as 

determined using BLASTN version 2.0 with the default parameters to a gene product 
whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product 
encoded by a nucleic acid having at least 70% nucleotide sequence identity as determined 

20 using BLASTN version 2.0 with the default parameters to a nucleic acid encoding a gene 

product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from die group consisting of SEQ ID NOs: 1-6213, a gene product 
having at least 25% amino acid identity as determined using FASTA version 3.0t78 with 
the default parameters to a gene product whose expression is inhibited by an antisense 

25 nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 

ID NOs.: 1-6213, a gene product encoded by a nucleic acid which hybridizes to a nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 
1-6213 under stringent conditions, a gene product encoded by a nucleic acid which 
hybridizes to a nucleic acid comprising a nucleotide sequence selected from the group 

30 consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene product whose 

activity may be complemented by the gene product whose activity is inhibited by a nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 
1-6213 is overexpressed, and wherein said first compound and said second compound 
inhibit the proliferation of said organism; and 

35 comparing the partem of strains which grow on said first solid medium with the 

pattern of strains which grow on said second solid medium. 
390. A method of profiling a first compound's activity comprising: 
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growing an array of strains on a first solid medium comprising said first compound 
and on a second solid medium comprising a second compound, wherein each strain in said 
array overexpresses a different gene product which is essential for proliferation of an 
organism, wherein said culture comprises a strain in which a gene product encoded by a 
5 nucleic acid comprising a nucleotide sequence selected from the group consisting of a 

nucleic acid comprising a nucleic acid having at least 70% nucleotide sequence identity as 
determined using BLASTN version 2.0 with the default parameters to a nucleotide 
sequence selected from the group consisting of SEQ ID NOS.: 6214-42397, a nucleic acid 
comprising a nucleotide sequence which hybridizes to a sequence selected from the group 
10 consisting of SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic acid 

comprising a nucleotide sequence which hybridizes to a nucleotide sequence selected from 
the group consisting of SEQ ID NOS.: 6214-42397 under moderate conditions is 
overexpressed, and wherein said first compound and said second compound inhibit the 
proliferation of said organism; and 
15 comparing the pattern of strains which grow on said first solid medium with the 

pattern of strains which grow on said second solid medium. 
391. A method of profiling a first compound's activity comprising: 

growing an array of strains on a first solid medium comprising said first compound 
and on a second solid medium comprising a second compound, wherein each strain in said 
20 array overexpresses a different gene product which is essential for proliferation of an 

organism, wherein said culture comprises a strain in which a gene product comprises a 
polypeptide selected from the group consisting of a polypeptide having at least 25% amino 
acid identity as determined using FASTA version 3.0t78 to a polypeptide selected from the 
group consisting of SEQ ID NOs.: 42938-78581 and a polypeptide whose activity may be 
25 complemented by a polypeptide selected from the group consisting of SEQ ID NOs: 42938- 

78581 is overexpressed, and wherein said first compound and said second compound 
inhibit the proliferation of said organism; and 

comparing the pattern of strains which grow on said 'first solid medium with the 
pattern of strains which grow on said second solid medium. 
30 392. The method of any one of Paragraphs 380, 381, 382, 383, 384, 385, 386, 387, 388, 

389, 390 or 391, wherein said first compound is present in a crude or partially purified state. 

393. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

obtaining a culture comprising a plurality of strains wherein each strain 
35 underexpresses a different gene product which is essential for proliferation of said 

organism, wherein said culture comprises a strain in which a gene product whose activity or 
level is inhibited by a nucleic acid comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NOs.: 1-6213 is underexpressed; 
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contacting said culture with a sufficient concentration of said compound to inhibit 
the proliferation of strains of said organism which underexpress said gene product on which 
said compound acts, such that strains which underexpress said gene product on which said 
compound acts proliferate more slowly than strains which do not underexpress said gene 
5 product on which said compound acts; and 

identifying the gene product which is underexpressed in a strain which proliferated 
more slowly in said culture. 

394. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

10 obtaining a culture comprising a plurality of strains wherein each strain 

underexpresses a different gene product which is essential for proliferation of said 
organism, wherein said culture comprises a strain in which a gene product encoded by a 
nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 
ID NOs.: 6214-42397 is underexpressed; 

15 contacting said culture with a sufficient concentration of said compound to inhibit 

the proliferation of strains of said organism which underexpress said gene product on which 
said compound acts, such that strains which underexpress said gene product on which said 
compound acts proliferate more slowly than strains which do not underexpress said gene 
product on which said compound acts; and 

20 identifying the gene product which is underexpressed in a strain which proliferated 

more slowly in said culture. 

395. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

obtaining a culture comprising a plurality of strains wherein each strain 
25 underexpresses a different gene product which is essential for proliferation of said 

organism, wherein said culture comprises a strain in which a gene product comprising an 
amino acid sequence selected from the group consisting of SEQ ID NOs.: 42938-78581 is 
underexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 
30 the proliferation of strains of said organism which underexpress said gene product on which 

said compound acts, such that strains which underexpress said gene product on which said 
compound acts proliferate more slowly than strains which do not underexpress said gene 
product on which said compound acts; and 

identifying the gene product which is underexpressed in a stain which proliferated 
35 more slowly in said culture. 

396. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 
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obtaining a culture comprising a plurality of strains wherein each strain 
underexpresses a different gene product which is essential for proliferation of said 
organism, wherein said culture comprises a strain in which a gene product selected from the 
group consisting of a gene product having at least 70% nucleotide sequence identity as 

5 determined using BLASTN version 2.0 with the default parameters to a gene product 

whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product 
encoded by a nucleic acid having at least 70% nucleotide sequence identity as determined 
using BLASTN version 2.0 with the default parameters to a nucleic acid encoding a gene 

10 product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 

sequence selected from the group consisting of SEQ ID NOs: 1-6213, a gene product 
having at least 25% amino acid identity as determined using FASTA version 3.0t78 with 
the default parameters to a gene product whose expression is inhibited by an antisense 
nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 

15 id NOs.: 1-6213, a gene product encoded by a nucleic acid which hybridizes to a nucleic 

acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 
1-6213 under stringent conditions, a gene product encoded by a nucleic acid which 
hybridizes to a nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene product whose 

20 activity may be complemented by the gene product whose activity is inhibited by a nucleic 

acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 
1-6213 is underexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 
the proliferation of strains of said organism which underexpress said gene product on which 

25 said compound acts, such that strains which underexpress said gene product on which said 

compound acts proliferate more slowly than strains which do not underexpress said gene 
product on which said compound acts; and 

identifying the gene product which is underexpressed in a strain which proliferated 
more slowly in said culture. 

30 397. A method for identifying the gene product on which a compound which inhibits 

proliferation of an organism acts comprising: 

obtaining a culture comprising a plurality of strains wherein each strain 
underexpresses a different gene product which is essential for proliferation of said 
organism, wherein said culture comprises a strain in which a gene product encoded by a 

35 nucleic acid comprising a nucleotide sequence selected from the group consisting of a 

nucleic acid comprising a nucleic acid having at least 70% nucleotide sequence identity as 
determined using BLASTN version 2.0 with the default parameters to a nucleotide 
sequence selected from the group consisting of SEQ ID NOS.: 6214-42397, a nucleic acid 
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comprising a nucleotide sequence which hybridizes to a sequence selected from the group 
consisting of SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic acid 
comprising a nucleotide sequence which hybridizes to a nucleotide sequence selected from 
the group consisting of SEQ ID NOS.: 6214-42397 under moderate conditions is 
5 underexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 
the proliferation of strains of said organism which underexpress said gene product on which 
said compound acts, such that strains which underexpress said gene product on which said 
compound acts proliferate more slowly than strains which do not underexpress said gene 
10 product on which said compound acts; and 

identifying the gene product which is underexpressed in a strain which proliferated 
more slowly in said culture. 

398. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 
15 obtaining a culture comprising a plurality of strains wherein each strain 

underexpresses a different gene product which is essential for proliferation of said 
organism, wherein said culture comprises a strain in which a gene product comprises a 
polypeptide selected from the group consisting of a polypeptide having at least 25% amino 
acid identity as determined using FASTA version 3.0t78 to a polypeptide selected from the 
20 group consisting of SEQ ID NOs.: 42938-78581 and a polypeptide whose activity may be 

complemented by a polypeptide selected from the group consisting of SEQ ID NOs: 42938- 
78581 is underexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 
the proliferation of strains of said organism which underexpress said gene product on which 
25 said compound acts, such that strains which underexpress said gene product on which said 

compound acts proliferate more slowly than strains which do not underexpress said gene 
product on which said compound acts; and 

identifying the gene product which is underexpressed in a strain which proliferated 
more slowly in said culture. 
30 399. The method of Paragraph 393, 394, 395, 396, 397 or 398, wherein at least one 

strain in said culture does not underexpresses a gene product which is essential for proliferation of 
said organism. 

400. The method of Paragraph 393, 394, 395, 396, 397 or 398, wherein said strains 
which underexpresess said gene products comprise a nucleic acid complementary to at least a 

35 portion of a gene encoding said gene product which is essential for proliferation of said organism 
operably linked to a regulatable promoter. 

401. The method of Paragraph 393, 394, 395, 396, 397 or 398, wherein said strains 
which underexpress said gene products express an antisense nucleic acid complementary to at least 
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a portion of a gene encoding said gene product which is essential for proliferation of said organism, 
wherein expression of said antisense nucleic acid reduces expression of said gene product in said 
strain. 

402. The method of Paragraph 393, 394, 395, 396, 397 or 398, wherein said 
5 identification step comprises determining the nucleotide sequence of a nucleic acid encoding said 

gene product in said strain which proliferated more slowly. 

403. The method of Paragraph 393, 394, 395, 396, 397 or 398, wherein said 
identification step comprises performing an amplification reaction to identify the nucleic acid 
encoding said gene product in said cell which proliferated more slowly. 

10 ' 404. The method of Paragraph 393, 394, 395, 396, 397 or 398, wherein the products of 

said amplification reaction are labeled with a detectable dye. 

405. The method of Paragraph 404, wherein said identification step comprises 
performing a hybridization procedure. 

406. Hie method of Paragraph 393, 394, 395, 396, 397 or 398, wherein said 
15 identification step comprises contacting a nucleic acid array with a nucleic acid encoding said gene 

product in said cell which proliferated more slowly. 

407. The method of Paragraph 393, 394, 395, 396, 397 or 398, wherein said organism is 
selected from the group consisting of bacteria, fungi, protozoa. 

408. The method of Paragraph 393, 394, 395, 396, 397 or 398, wherein said culture is a 
20 culture of an organism selected from the group consisting of Acinetobacter baumannii, Anaplasma 

marginale, Aspergillus fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, 
Borrelia burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkltolderia mallei, 
Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), 
Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida bvsei, Candida kefyr 

25 (also called Candida pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia 
trachomatis, Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium 
perfringens, Coccidioides immitis, Corynebacterium diptheriae, Cryptococcus neoformans, 
Enterobacter cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, 
Haemophilus influenzae, Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, 

30 Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, 
Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma 
genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia 
asteroides, Pasteurella haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus 
mirabilis, Proteus vulgaris, Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas 

35 syringae, Salmonella bongori, Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, 
Salmonella typhi, Salmonella typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, 
Shigella sonnei, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, 
Streptococcus pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, 
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Ureaplasma urealyticum, Vibrio cholerae. Vibrio parahaemolyticus, Vibrio vulnificaiis, Yersinia 
enterocolitica, Yersinia pestis and any species falling within the genera of any of the above species. 

409. The method of Paragraph 393, 394, 395, 396, 397 or 398, wherein said compound 
is obtained from a library of natural compounds. 
5 410. The method of Paragraph 393, 394, 395, 396, 397 or 398, wherein said compound 

is obtained from a library of synthetic compounds. 

411. The method of Paragraph 393, 394, 395, 396, 397 or 398, wherein said compound 
is present in a crude or partially purified state. 

412. The method of Paragraph 393, 394, 395, 396, 397 or 398, further comprising 
10 determining whether said gene product in said strain which proliferated more slowly in said culture 

has a counterpart in at least one other organism. 

413. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

obtaining a plurality of cultures, each culture comprising a plurality of strains 
15 wherein each strain underexpresses a different gene product which is essential for 

proliferation of said organism, wherein said culture comprises a strain in which a gene 
product whose activity or level is inhibited by a nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 1-6213 is underexpressed; . 

contacting each of said cultures with a different concentration of said compound; 

20 and 

identifying the gene product which is underexpressed in a strain whose rate of 
proliferation is reduced by said compound. 

414. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

25 obtaining a plurality of cultures, each culture comprising a plurality of strains 

wherein each strain underexpresses a different gene product which is essential for 
proliferation of said organism, wherein said culture comprises a strain in which a gene 
product encoded by a nucleic acid comprising a nucleotide sequence selected from the 
group consisting of SEQ ED NOs.: 6214-42397 is underexpressed; 

30 contacting each of said cultures with a different concentration of said compound; 

and 

identifying the gene product which is underexpressed in a strain whose rate of 
proliferation is reduced by said compound. 

415. A method for identifying the gene product on Which a compound which inhibits 
35 proliferation of an organism acts comprising: 

obtaining a plurality of cultures, each culture comprising a plurality of strains 
wherein each strain underexpresses a different gene product which is essential for 
proliferation of said organism, wherein said culture comprises a strain in which a gene 
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product comprising an amino acid sequence selected from the group consisting of SEQ ID 
NOs.: 42938-78581 is underexpressed; 

contacting each of said cultures with a different concentration of said compound; 

and 

5 identifying the gene product which is underexpressed in a strain whose rate of 

proliferation is reduced by said compound. 

416. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

obtaining a plurality of cultures, each culture comprising a plurality of strains 

10 wherein each strain underexpresses a different gene product which is essential for 

proliferation of said organism, wherein said culture comprises a strain in which a gene 
product selected from the group consisting of a gene product having at least 70% nucleotide 
sequence identity as determined using BLASTN version 2.0 with the default parameters to 
a gene product whose expression is inhibited by an antisense nucleic acid comprising a 

15 nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene 

product encoded by a nucleic acid having at least 70% nucleotide sequence identity as 
determined using BLASTN version 2.0 with the default parameters to a nucleic acid 
encoding a gene product whose expression is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 1- 

20 6213, a gene product having at least 25% amino acid identity as determined using FASTA 

version 3.0t78 with the default parameters to a gene product whose expression is inhibited 
by an antisense nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic acid which 
hybridizes to a nucleic acid comprising a nucleotide sequence selected from the group 

25 consisting of SEQ ED NOs.: 1-6213 under stringent conditions, a gene product encoded by 

a nucleic acid which hybridizes to a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene 
product whose activity may be complemented by the gene product whose activity is 
inhibited by a nucleic acid comprising a nucleotide sequence selected from the group 

30 consisting of SEQ ID NOs: 1-6213 is underexpressed; 

contacting each of said cultures with a different concentration of said compound; 

and 

identifying the gene product which is underexpressed in a strain whose rate of 
proliferation is reduced by said compound. 
35 41 7 - A method for identifying the gene product on which a compound which inhibits 

proliferation of an organism acts comprising: 

obtaining a plurality of cultures, each culture comprising a plurality of strains 
wherein each strain underexpresses a different gene product which is essential for 
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proliferation of said organism, wherein said culture comprises a strain in which a gene 
product encoded by a nucleic acid comprising a nucleotide sequence selected from the 
group consisting of a nucleic acid comprising a nucleic acid having at least 70% nucleotide 
sequence identity as determined using BLASTN version 2.0 with the default parameters to 

5 a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214-42397, a 

nucleic acid comprising a nucleotide sequence which hybridizes to a sequence selected 
from the group consisting of SEQ ID NOS.: 6214-42397 under stringent conditions, and a 
nucleic acid comprising a nucleotide sequence which hybridizes to a nucleotide sequence 
selected from the group consisting of SEQ ID NOS.: 6214-42397 under moderate 

10 conditions is underexpressed; 

contacting each of said cultures with a different concentration of said compound; 

and 

identifying the gene product which is underexpressed in a strain whose rate of 
proliferation is reduced by said compound. 
15 418. A method for identifying the gene product on which a compound which inhibits 

proliferation of an organism acts comprising: 

obtaining a plurality of cultures, each culture comprising a plurality of strains 
wherein each strain underexpresses a different gene product which is essential for 
proliferation of said organism, wherein said culture comprises a strain in which a gene 
20 product comprises a polypeptide selected from the group consisting of a polypeptide having 

at least 25% amino acid identity as determined using FASTA version 3.0t78 to a 
polypeptide selected from the group consisting of SEQ ID NOs.: 42938-78581 and a 
polypeptide whose activity may be complemented by a polypeptide selected from the group 
consisting of SEQ ID NOs: 42938-78581 is underexpressed; 
25 contacting each of said cultures with a different concentration of said compound; 

and 

identifying the gene product which is underexpressed in a strain whose rate of 
proliferation is reduced by said compound. 
419. A method of profiling a compound's activity comprising: 
30 performing the method of Paragraph 393 on a first culture using a first compound; 

performing the method of Paragraph 393 on a second culture using a second 
compound; and 

comparing the strains identified in said first culture to the strains identified in said 
second culture. 

35 420. A method of profiling a compound's activity comprising: 

performing the method of Paragraph 394 on a first culture using a first compound; 
performing the method of Paragraph 394 on a second culture using a second 
compound; and 
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comparing the strains identified in said first culture to the strains identified in said 
second culture. 

42 1 . A method of profiling a compound' s activity comprising: 

performing the method of Paragraph 395 on a first culture using a first compound; 
5 performing the method of Paragraph 395 on a second culture using a second 

compound; and 

comparing the strains identified in said first culture to the strains identified in said 
second culture. 

422. A method of profiling a compound's activity comprising 

10 performing the method of Paragraph 396 on a first culture using a first compound; 

performing the method of Paragraph 396 on a second culture using a. second 
compound; and 

comparing the strains identified in said first culture to the strains identified in said 
second culture. 

15 423 . A method of profiling a compound's activity comprising 

performing the method of Paragraph 397 on a first culture using a first compound; 
performing the method of Paragraph 397 on a second culture using a second 
compound; and 

comparing the strains identified in said first culture to the strains identified in said 
20 second culture. 

424. A method of profiling a compound' s activity comprising 

performing the method of Paragraph 398 on a first culture using a first compound; 
performing the method of Paragraph 398 on a second culture using a second 
compound; and 

25 comparing the strains identified in said first culture to the strains identified in said 

second culture. 

425. A method of profiling a first compound's activity comprising: 

growing an array of strains on a first solid medium comprising said first compound 
and on a second solid medium comprising a second compound, wherein said array 

30 comprises a plurality of strains wherein each strain underexpresses a different gene product 

which is essential for proliferation of an organism, wherein said culture comprises a strain 
in which a gene product whose activity or level is inhibited by a nucleic acid comprising a 
nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213 is 
underexpressed, and wherein said first compound and said second compound inhibit the 

35 proliferation of said organism; and 

comparing the pattern of strains which grow on said first solid medium with the 
pattern of strains which grow on said second solid medium. 

426. A method of profiling a first compound's activity comprising: 
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growing an array of strains on a first solid medium comprising said first compound 
and on a second solid medium comprising a second compound, wherein said array 
comprises a plurality of strains wherein each strain underexpresses a different gene product 
which is essential for proliferation of an organism, wherein said culture comprises a strain 
5 in which a gene product encoded by a nucleic acid comprising a nucleotide sequence 

selected from the group consisting of SEQ ID NOs.: 6214-42397 is underexpressed, and 
wherein said first compound and said second compound inhibit the proliferation of said 
organism; and 

comparing the pattern of strains which grow on said first solid medium with the 
10 pattern of strains which grow on said second solid medium. 

427. A method of profiling a first compound's activity comprising: 

growing an array of strains on a first solid medium comprising said first compound 
and on a second solid medium comprising a second compound, wherein said array 
comprises a plurality of strains wherein each strain underexpresses a different gene product 
15 which is essential for proliferation of an organism, wherein said culture comprises a strain 

in which a gene product comprising an amino acid sequence selected from the group 
consisting of SEQ ID NOs.: 42938-78581 is underexpressed, and wherein said first 
compound and said second compound inhibit the proliferation of said organism; and 

comparing the pattern of strains which grow on said first solid medium with the 
20 pattern of strains which grow on said second solid medium. 

428. A method of profiling a first compound's activity comprising: 

growing an array of strains on a first solid medium comprising said first compound 
and on a second solid medium comprising a second compound, wherein said array 
comprises a plurality of strains wherein each strain underexpresses a different gene product 

25 which is essential for proliferation of an organism, wherein said culture comprises a strain 

in which a gene product selected from the group consisting of a gene product having at 
least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with the 
default parameters to a gene product whose expression is inhibited by an antisense nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 

30 1-6213, a gene product encoded by a nucleic acid having at least 70% nucleotide sequence 

identity as determined using BLASTN version 2.0 with the default parameters to a nucleic 
acid encoding a gene product whose expression is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 1- 
6213, a gene product having at least 25% amino acid identity as determined using FASTA 

35 version 3.0t78 with the default parameters to a gene product whose expression is inhibited 

by an antisense nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic acid which 
hybridizes to a nucleic acid comprising a nucleotide sequence selected from the group 
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consisting of SEQ ID NOs.: 1-6213 under stringent conditions, a gene product encoded by 
a nucleic acid which hybridizes to a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene 
product whose activity may be complemented by the gene product whose activity is 

5 inhibited by a nucleic acid comprising a nucleotide sequence selected from the group 

consisting of SEQ ED NOs: 1-6213 is underexpressed, and wherein said first compound 
and said second compound inhibit the proliferation of said organism; and 

comparing the pattern of strains which grow on said first solid medium with the 
pattern of strains which grow on said second solid medium. 

10 429. A method of profiling a first compound's activity comprising: 

growing an array of strains on a first solid medium comprising said first compound 
and on a second solid medium comprising a second compound, wherein said array 
comprises a plurality of strains wherein each strain underexpresses a different gene product 
which is essential for proliferation of an organism , wherein said culture comprises a strain 

15 in which a gene product encoded by a nucleic acid comprising a nucleotide sequence 

selected from the group consisting of a nucleic acid comprising a nucleic acid having at 
least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with the 
default parameters to a nucleotide sequence selected from the group consisting of SEQ ID 
NOS.: 6214-42397, a nucleic acid comprising a nucleotide sequence which hybridizes to a 

20 sequence selected from the group consisting of SEQ ID NOS.: 6214-42397 under stringent 

conditions, and a nucleic acid comprising a nucleotide sequence which hybridizes to a 
nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214-42397 
under moderate conditions is underexpressed, and wherein said first compound and said 
second compound inhibit the proliferation of said organism; and 

25 comparing the pattern of strains which grow on said first solid medium with the 

pattern of strains which grow on said second solid medium. 
430. A method of profiling a first compound's activity comprising: 

growing an array of strains on a first solid medium comprising said first compound 
and on a second solid medium comprising a second compound, wherein said array 

30 comprises a plurality of strains wherein each strain underexpresses a different gene product 

which is essential for proliferation of an organism, wherein said culture comprises a strain 
in which a gene product comprises a polypeptide selected from the group consisting of a 
polypeptide having at least 25% amino acid identity as determined using FASTA version 
3.0t78 to a polypeptide selected from the group consisting of SEQ ID NOs.: 42938-78581 

35 and a polypeptide whose activity may be complemented by a polypeptide selected from the 

group consisting of SEQ ID NOs: 42938-78581 is underexpressed, and wherein said first 
compound and said second compound inhibit the proliferation of said organism; and 
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comparing the pattern of strains which grow on said first solid medium with the 
pattern of strains which grow on said second solid medium. 

431. The method of any one of Paragraphs 419, 420, 421, 422, 423, 424, 425, 426, 427, 
428, 429 or 430, wherein said first compound is present in a crude or partially purified state. 
5 432. A method for identifying the gene product on which a compound which inhibits 

proliferation of an organism acts comprising: 

obtaining a plurality of cultures comprising a plurality of strains wherein each 
strain underexpresses a different gene product which is essential for proliferation of said 
organism, wherein said culture comprises a strain in which a gene product whose activity or 
10 level is inhibited by a nucleic acid comprising a nucleotide sequence selected from the 

group consisting of SEQ ID NOs.: 1-6213 is underexpressed; 

contacting each of said plurality of cultures with a varying concentration of a 
regulatory agent which regulates the level of expression of said gene products which are 
essential for proliferation of said organism; and 
15 identifying the gene product which is underexpressed in a strain whose rate of 

proliferation is reduced by said compound. 

433. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

obtaining a plurality of cultures comprising a plurality of strains wherein each 
20 strain underexpresses a different gene product which is essential for proliferation of said 

organism, wherein said culture comprises a strain in which a gene product encoded by a 
nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 
ID NOs.: 62 14-42397 is underexpressed; 

contacting each of said plurality of cultures with a varying concentration of a 
25 regulatory agent which regulates the level of expression of said gene products which are 

essential for proliferation of said organism; and 

identifying the gene product which is underexpressed in a strain whose rate of. 
proliferation is reduced by said compound. 

434. A method for identifying the gene product on which a compound which inhibits 
30 proliferation of an organism acts comprising: 

obtaining a plurality of cultures comprising a plurality of strains wherein each 
strain underexpresses a different gene product which is essential for proliferation of said 
organism, wherein said culture comprises a strain in which a gene product comprising an 
amino acid sequence selected from the group consisting of SEQ ID NOs.: 42938-78581 is 
35 underexpressed; 

contacting each of said plurality of cultures with a varying concentration of a 
regulatory agent which regulates the level of expression of said gene products which are 
essential for proliferation of said organism; and 
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identifying the gene product which is underexpressed in a strain whose rate of 
proliferation is reduced by said compound. 

435. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

5 obtaining a plurality of cultures comprising a plurality of strains wherein each 

strain underexpresses a different gene product which is essential for proliferation of said 
organism, wherein said culture comprises a strain in which a gene product selected from the 
group consisting of a gene product having at least 70% nucleotide sequence identity as 
determined using BLASTN version 2.0 with the default parameters to a gene product 

10 whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 

sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product 
encoded by a nucleic acid having at least 70% nucleotide sequence identity as determined 
using BLASTN version 2.0 with the default parameters to a nucleic acid encoding a gene 
product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 

15 sequence selected from the group consisting of SEQ ID NOs: 1-6213, a gene product 

having at least 25% amino acid identity as determined using FASTA version 3.0t78 with 
the default parameters to a gene product whose expression is inhibited by an antisense 
nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 
ID NOs.: 1-6213, a gene product encoded by a nucleic acid which hybridizes to a nucleic 

20 acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 

1-6213 under stringent conditions, a gene product encoded by a nucleic acid which 
hybridizes to a nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene product whose 
activity may be complemented by the gene product whose activity is inhibited by a nucleic 

25 acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 

1-6213 is underexpressed; 

contacting each of said plurality of cultures with a varying concentration of a 
regulatory agent which regulates the level of expression of said gene products which are 
essential for proliferation of said organism; and 

30 identifying the gene product which is underexpressed in a strain whose rate of 

proliferation is reduced by said compound. 

436. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

obtaining a plurality of cultures comprising a plurality of strains wherein each 
35 strain underexpresses a different gene product which is essential for proliferation of said 

organism, wherein said culture comprises a strain in which a gene product encoded by a 
nucleic acid comprising a nucleotide sequence selected from the group consisting of a 
nucleic acid comprising a nucleic acid having at least 70% nucleotide sequence identity as 
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determined using BLASTN version 2.0 with the default parameters to a nucleotide 
sequence selected from the group consisting of SEQ ID NOS.: 621442397, a nucleic acid 
comprising a nucleotide sequence which hybridizes to a sequence selected from the group 
consisting of SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic acid 
5 comprising a nucleotide sequence which hybridizes to a nucleotide sequence selected from 

the group consisting of SEQ ID NOS.: 6214-42397 under moderate conditions is 
underexpressed; 

contacting each of said plurality of cultures with a varying concentration of a 
regulatory agent which regulates the level of expression of said gene products which are 
10 essential for proliferation of said organism; and 

identifying the gene product which is underexpressed in a strain whose rate of 
proliferation is reduced by said compound. 

437. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

15 obtaining a plurality of cultures comprising a plurality of strains wherein each 

strain underexpresses a different gene product which is essential for proliferation of said 
organism, wherein said culture comprises a strain in which a gene product comprises a 
polypeptide selected from the group consisting of a polypeptide having at least 25% amino 
acid identity as determined using FASTA version 3.0t78 to a polypeptide selected from the 

20 group consisting of SEQ ID NOs.: 42938-78581 and a polypeptide whose activity may be 

complemented by a polypeptide selected from the group consisting of SEQ ID NOs: 42938- 
785 8 1 is underexpressed; 

contacting each of said plurality of cultures with a varying concentration of a 
regulatory agent which regulates the level of expression of said gene products which are 

25 essential for proliferation of said organism; and 

identifying the gene product which is underexpressed in a strain whose rate of 
proliferation is reduced by said compound. 

438. A culture comprising a plurality of strains wherein each strain overexpresses a 
different gene product which is essential for proliferation of said organism, wherein said culture 

30 comprises a strain in which a gene product whose activity or level is inhibited by a nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213 is 
overexpressed. 

439. A culture comprising a plurality of strains wherein each strain overexpresses a 
different gene product which is essential for proliferation of said organism, wherein said culture 

35 comprises a strain in which a gene product encoded by a nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 6214-42397 is overexpressed. 

440. A culture comprising a plurality of strains wherein each strain overexpresses a 
different gene product which is essential for proliferation of said organism, wherein said culture 
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comprises a strain in which a gene product comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOs.: 42938-78581 is overexpressed. 

441. A culture comprising a plurality of strains wherein each strain overexpresses a 
different gene product which is essential for proliferation of said organism, wherein said culture 

5 comprises a strain in which a gene product selected from the group consisting of a gene product 
having at least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with the 
default parameters to a gene product whose expression is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a 
gene product encoded by a nucleic acid having at least 70% nucleotide sequence identity as 

10 determined using BLASTN version 2.0 with the default parameters to a nucleic acid encoding a 
gene product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs: 1-6213, a gene product having at least 
25% amino acid identity as determined using FASTA version 3.0t78 with the default parameters to 
a gene product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 

15 sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a 
nucleic acid which hybridizes to a nucleic acid comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NOs.: 1-6213 under stringent conditions, a gene product encoded by a 
nucleic acid which hybridizes to a nucleic acid comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene product whose 

20 activity may be complemented by the gene product whose activity is inhibited by a nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 1-6213 is 
overexpressed. 

442. A culture comprising a plurality of strains wherein each strain overexpresses a 
different gene product which is essential for proliferation of said organism, wherein said culture 

25 comprises a strain in which a gene product encoded by a nucleic acid comprising a nucleotide 
sequence selected from the group consisting of a nucleic acid comprising a nucleic acid having at 
least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with the default 
parameters to a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214- 
42397, a nucleic acid comprising a nucleotide sequence which hybridizes to a sequence selected 

30 from the group consisting of SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic 
acid comprising a nucleotide sequence which hybridizes to a nucleotide sequence selected from the 
group consisting of SEQ ID NOS.: 6214-42397 under moderate conditions is overexpressed. 

443. A culture comprising a plurality of strains wherein each strain overexpresses a 
different gene product which is essential for proliferation of said organism, wherein said culture 

35 comprises a strain in which a gene product comprises a polypeptide selected from the group 
consisting of a polypeptide having at least 25% amino acid identity as determined using FASTA 
version 3.0t78 to a polypeptide selected from the group consisting of SEQ ID NOs.: 42938-78581 
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and a polypeptide whose activity may be complemented by a polypeptide selected from the group 
consisting of SEQID NOs: 42938-78581 is overexpressed. 

444. The culture of Paragraph 438, 439, 440, 441, 442 or 443, wherein said strains 
which overexpresess said gene products comprise a nucleic acid encoding said gene product which 

5 is essential for proliferation of said organism operably linked to a regulatable promoter. 

445. The culture of Paragraph 438, 439, 440, 441, 442 or 443, wherein said strains 
which overexpresess said gene products comprise a nucleic acid encoding said gene product which 
is essential for proliferation of said organism operably linked to a constitutive promoter. 

446. The culture of Paragraph 438, 439, 440, 441, 442 or 443, wherein said culture is a 
10 culture of an organism selected from the group consisting of Acinetobacter baumannii, Anaplasma 

marginale, Aspergillus fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, 
Boirelia burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkholderia mallei, 
Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), 
Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida krusei, Candida kefyr 

15 (also called Candida pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia 
trachomatis, Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium 
perfringens, Coccidioides immitis, Corynebacterium diptheriae, Cryptococcus neoformans, 
Enterobacter cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, 
Haemophilus influenzae, Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, 

20 Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, 
Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma 
genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia 
asteroides, Pasteurella haemolytica, Pasteurella midtocida, Pneumocystis carinii, Proteus 
mirabilis, Proteus vulgaris, Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas 

25 syringae, Salmonella bongori, Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, 
Salmonella typhi, Salmonella typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, 
Shigella sonnei, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, 
Streptococcus pneumoniae, Streptococcus mutatis, Streptococcus pyogenes, Treponema pallidum, 
Ureaplasma urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulniftcans, Yersinia 

30 enterocolitica, Yersinia pestis and any species falling within the genera of any of the above species. 

447. A culture comprising a a plurality of strains wherein each strain underexpresses a 
different gene product which is essential for proliferation of said organism, wherein said culture 
comprises a strain in which a gene product whose activity or level is inhibited by a nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213 is 

35 underexpressed. 

448. A culture comprising a a plurality of strains wherein each strain underexpresses a 
different gene product which is essential for proliferation of said organism, wherein said culture 
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comprises a strain in which a gene product encoded by a nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 6214-42397 is underexpressed. 

449. A culture comprising a a plurality of strains wherein each strain underexpresses a 
different gene product which is essential for proliferation of said organism, wherein said culture 

5 comprises a strain in which a gene product comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOs.: 42938-78581 is underexpressed. 

450. A culture comprising a a plurality of strains wherein each strain underexpresses a 
different gene product which is essential for proliferation of said organism, wherein said culture 
comprises a strain in which a gene product selected from the group consisting of a gene product 

10 having at least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with the 
default parameters to a gene product whose expression is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a 
gene product encoded by a nucleic acid having at least 70% nucleotide sequence identity as 
determined using BLASTN version 2.0 with the default parameters to a nucleic acid encoding a 

15 gene product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs: 1-6213, a gene product having at least 
25% amino acid identity as determined using FASTA version 3.0t78 with the default parameters to 
a gene product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a 

20 nucleic acid which hybridizes to a nucleic acid comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NOs.: 1-6213 under stringent conditions, a gene product encoded by a 
nucleic acid which hybridizes to a nucleic acid comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene product whose 
activity may be complemented by the gene product whose activity is inhibited by a nucleic acid 

25 comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 1-6213 is 
underexpressed. 

451. A culture comprising a a plurality of strains wherein each strain underexpresses a 
different gene product which is essential for proliferation of said organism, wherein said culture 
comprises a strain in which a gene product encoded by a nucleic acid comprising a nucleotide 

30 sequence selected from the group consisting of a nucleic acid comprising a nucleic acid having at 
least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with the default 
parameters to a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214- 
42397, a nucleic acid comprising a nucleotide sequence which hybridizes to a sequence selected 
from the group consisting of SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic 

35 acid comprising a nucleotide sequence which hybridizes to a nucleotide sequence selected from the 
group consisting of SEQ ID NOS.: 6214-42397 under moderate conditions is underexpressed. 

452. A culture comprising a a plurality of strains wherein each strain underexpresses a 
different gene product which is essential for proliferation of said organism, wherein said culture 
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comprises a stain in which a gene product comprises a polypeptide selected from the group 
consisting of a polypeptide having at least 25% amino acid identity as determined using FASTA 
version 3.0t78 to a polypeptide selected from the group consisting of SEQ ID NOs.: 42938-78581 
and a polypeptide whose activity may be complemented by a polypeptide selected from the group 
5 consisting of SEQ ID NOs: 42938-78581 is underexpressed. 

453. The culture of Paragraph 447, 448, 449, 450, 451 or 452, wherein said strains 
which underexpress said gene products comprise a nucleic acid encoding said gene product which 
is essential for proliferation of said organism operably linked to a regulatable promoter. 

454. The culture of Paragraph 447, 448, 449, 450, 451 or 452, wherein said strains 
10 which underexpress said gene products comprise a nucleic acid encoding said gene product which 

is essential for proliferation of said organism operably linked to a constitutive promoter. 

455. The culture of Paragraph 447, 448, 449, 450, 451 or 452, wherein said culture is a 
culture of an onanism selected from the group consisting of Acinetobacter baumannii, Anaplasma 
marginale, Aspergillus fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, 

15 Borrelia burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkholderia mallei, 
Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata\ 
Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida krusei, Candida kefyr 
(also called Candida pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia 
trachomatis, Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium. 

20 perjringens, Coccidioides immitis, Cojynebacterium diptheriae, Cryptococcus neoformans,. 
Enterobacter cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, 
Haemophilus influenzae, Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, 
Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, 
Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma 

25 genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia 
asteroides, Pasteurella haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus 
mirabilis, Proteus vulgaris, Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas 
syringae, Salmonella bongori, Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, 
Salmonella typhi, Salmonella typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, 

30 Shigella sonnei, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, 
Streptococcus pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, 
Ureaplasma urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia 
enterocolitica, Yersinia pestis and any species falling within the genera of any of the above species. 

456. A method for identifying the gene product on which a compound which inhibits 
3 5 proliferation of an organism acts comprising: 

obtaining a culture comprising a plurality of strains wherein each strain 
overexpresses a different gene product which is essential for proliferation of said organism 
and wherein the nucleotide sequence of each of the bverexpressed genes has been altered so 
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as to include a nucleotide sequence which can be used to generate a unique product 
corresponding to each of the overexpressed genes, wherein said culture comprises a strain 
in which a gene product whose activity or level is inhibited by a nucleic acid comprising a 
nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213 is 
5 overexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 
the proliferation of strains of said organism which do not overexpress said gene product on 
which said compound acts, such that strains which overexpress said gene product on which 
said compound acts proliferate more rapidly than strains which do not overexpress said 
10 gene product on which said compound acts; and 

identifying the gene product which is overexpressed in a strain which proliferated 
more rapidly in said culture by detecting the unique product corresponding to said gene. 

457. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

15 obtaining a culture comprising a plurality of strains wherein each strain 

overexpresses a different gene product which is essential for proliferation of said organism 
and wherein the nucleotide sequence of each of the overexpressed genes has been altered so 
as to include a nucleotide sequence which can be used to generate a unique product 
corresponding to each of the overexpressed genes, wherein said culture comprises a strain 

20 in which a gene product encoded by a nucleic acid comprising a nucleotide sequence 

selected from the group consisting of SEQ ID NOs.: 6214-42397 is overexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 
the proliferation of strains of said organism which do not overexpress said gene product on 
which said compound acts, such that strains which overexpress said gene product on which 

25 said compound acts proliferate more rapidly than strains which do not overexpress said 

gene product on which said compound acts; and 

identifying the gene product which is overexpressed in a strain which proliferated 
more rapidly in said culture by detecting the unique product corresponding to said gene. 

458. A method for identifying the gene product on which a compound which inhibits 
30 proliferation of an organism acts comprising: 

obtaining a culture comprising a plurality of strains wherein each strain 
overexpresses a different gene product which is essential for proliferation of said organism 
and wherein the nucleotide sequence of each of the overexpressed genes has been altered so 
as to include a nucleotide sequence which can be used to generate a unique product 
35 corresponding to each of the overexpressed genes, wherein said culture comprises a strain 

in which a gene product comprising an amino acid sequence selected from the group 
consisting of SEQ ID NOs.: 42938-78581 is overexpressed; 
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contacting said culture with a sufficient concentration of said compound to inhibit 
the proliferation of strains of said organism which do not overexpress said gene product on 
which said compound acts, such that strains which overexpress said gene product on which 
said compound acts proliferate more rapidly than strains which do not overexpress said 
5 gene product on which said compound acts; and 

identifying the gene product which is overexpressed in a strain which proliferated 
more rapidly in said culture by detecting the unique product corresponding to said gene. 
459. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

10 obtaining a culture comprising a plurality of strains wherein each strain 

overexpresses a different gene product which is essential for proliferation of said organism 
and wherein the nucleotide sequence of each of the overexpressed genes has been altered so 
as to include a nucleotide sequence which can be used to generate a unique product 
corresponding to each of the overexpressed genes, wherein said culture comprises a strain 

15 in which a gene product selected from the group consisting of a gene product having at 

least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with the 
default parameters to a gene product whose expression is inhibited by an antisense nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 
1-6213, a gene product encoded by a nucleic acid having at least 70% nucleotide sequence 

20 identity as determined using BLASTN version 2.0 with the default parameters to a nucleic 

acid encoding a gene product whose expression is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 1- 
6213, a gene product having at least 25% amino acid identity as determined using FASTA 
version 3.0t78 with the default parameters to a gene product whose expression is inhibited 

25 by an antisense nucleic acid comprising a nucleotide sequence selected from the group 

consisting of SEQ ED NOs.: 1-6213, a gene product encoded by a nucleic acid which 
hybridizes to a nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 1-6213 under stringent conditions, a gene product encoded by 
a nucleic acid which hybridizes to a nucleic acid comprising a nucleotide sequence selected 

30 from the group consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene 

product whose activity may be complemented by the gene product whose activity is 
inhibited by a nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs: 1-6213 is overexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 

35 the proliferation of strains of said organism which do not overexpress said gene product on 

which said'compound acts, such that strains which overexpress said gene product on which 
said compound acts proliferate more rapidly than strains which do not overexpress said 
gene product on which said compound acts; and 
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identifying the gene product which is overexpressed in a strain which proliferated 
more rapidly in said culture by detecting the unique product corresponding to said gene. 

460. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

5 obtaining a culture comprising a plurality of strains wherein each strain 

overexpresses a different gene product which is essential for proliferation of said organism 
and wherein the nucleotide sequence of each of the overexpressed genes has been altered so 
as to include a nucleotide sequence which can be used to generate a unique product 
corresponding to each of the overexpressed genes, wherein said culture comprises a strain 

10 in which a gene product encoded by a nucleic acid comprising a nucleotide sequence 

selected from the group consisting of a nucleic acid comprising a nucleic acid having at 
least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with the 
default parameters to a nucleotide sequence selected from the group consisting of SEQ ID 
NOS.: 6214-42397, a nucleic acid comprising a nucleotide sequence which hybridizes to a 

15 sequence selected from the group consisting of SEQ ID NOS.: 6214-42397 under stringent 

conditions, and a nucleic acid comprising a nucleotide sequence which hybridizes to a 
nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214-42397 
under moderate conditions is overexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 

20 the proliferation of strains of said organism which do not overexpress said gene product on 

which said compound acts, such that strains which overexpress said gene product on which 
said compound acts proliferate more rapidly than strains which do not overexpress said 
gene product on which said compound acts; and 

identifying the gene product which is overexpressed in a strain which proliferated 

25 more rapidly in said culture by detecting the unique product corresponding to said gene. 

461. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

obtaining a culture comprising a plurality of strains wherein each strain 
overexpresses a different gene product which is essential for proliferation of said organism 

30 and wherein the nucleotide sequence of each of the overexpressed genes has been altered so 

as to include a nucleotide sequence which can be used to generate a unique product 
corresponding to each of the overexpressed genes, wherein said culture comprises a strain 
in which a gene product comprises a polypeptide selected from the group consisting of a 
polypeptide having at least 25% amino acid identity as determined using FASTA version 

35 3.0t78 to a polypeptide selected from the group consisting of SEQ ID NOs.: 42938-78581 

and a polypeptide whose activity may be complemented by a polypeptide selected from the 
group consisting of SEQ ID NOs: 42938-78581 is overexpressed; 
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contacting said culture with a sufficient concentration of said compound to inhibit 
the proliferation of strains of said organism which do not overexpress said gene product on 
which said compound acts, such that strains which overexpress said gene product on which 
said compound acts proliferate more rapidly than strains which do not overexpress said 
5 gene product on which said compound acts; and 

identifying the gene product which is overexpressed in a strain which proliferated 
more rapidly in said culture by detecting the unique product corresponding to said gene. 

462. The method of Paragraph 456, 457, 458, 459, 460 or 461, wherein the nucleotide 
sequence of each of the genes encoding an overexpressed gene product has been altered by 

10 replacing the native promoters of said genes with promoters which facilitate overexpression of said 
gene products. 

463. The method of Paragraph 456, 457, 458, 459, 460 or 461, wherein the nucleotide 
sequence of each of the genes encoding an overexpressed gene product has been altered by inserting 
a regulatory element into the native promoters of said genes with a promoter which facilitates 

1 5 overexpression of said gene products. 

464. The method of Paragraph 463, wherein said regulatory element is selected from the 
group consisting of a regulatable promoter, an operator which is recognized by a repressor, a 
nucleotide sequence which is recognized by a transcriptional activator, a transcriptional terminator, 
a nucleotide sequence which introduces a bend in the DNA and an upstream activating sequence. 

20 465. The method of Paragraph 456, 457, 458, 459, 460 or 461, wherein the step of 

identifying the gene product which is overexpressed in a strain which proliferated more rapidly in 
said culture by detecting the unique product corresponding to said gene comprises performing an 
amplification reaction and detecting a unique amplification product corresponding to said gene. 

466. The method of Paragraph 462, wherein the native promoter of each of the genes 
25 encoding a gene product essential for proliferation is replaced with the same promoter. 

467. The method of Paragraph 462, wherein the native promoters of the genes encoding 
gene products essential for proliferation are replaced with a plurality of promoters selected to give a 
desired expression level for each gene product. 

468. The method of Paragraph 462, wherein said promoters which replaced the native 
30 promoters in each strain comprise regulatable promoters. 

469. The method of Paragraph 462, wherein said promoters which replaced the native 
promoters in each strain each strain comprise constitutive promoters. 

470. The method of Paragraph 456, 457, 458, 459, 460 or 461, wherein said organism is 
selected from the group consisting of bacteria, fungi, and protozoa. 

35 471. The method of Paragraph 456, 457, 458, 459, 460 or 461, wherein said culture is a 

culture of an organism selected from the group consisting of Acinetobacter baumannii, Anaplasma 
marginale, Aspergillus fumigalits, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, 
Borrelia burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkholderia mallei, 
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Campylobacter jejuni, Candida albicans. Candida glabrata (also called Torulopsis glabrata), 
Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida krusei, Candida Icejyr 
(also called Candida pseudotropicalis\ Candida dubliniensis, Chlamydia pneumoniae, Chlamydia 
trachomatis, Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium 

5 perfringens, Coccidioides immitis, Corynebacterium diptheriae, Cryptococcus neoformans, 
Enterobacter cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, 
Haemophilus influenzae, Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, 
Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, 
Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma 

10 genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia 
asteroides, Pasteurella haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus 
mirabilis, Proteus vulgaris, Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas 
syringae, Salmonella bongori, Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, 
Salmonella typhi, Salmonella typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, 

15 Shigella sonnei, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, 
Streptococcus pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, 
Ureaplasma urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia 
enter ocolxtica, Yersinia pestis and any species falling within the genera of any of the above species. 

472. A method for identifying the gene product on which a compound which inhibits 
20 proliferation of an organism acts comprising: 

obtaining a culture comprising a plurality of strains wherein each strain 
underexpresses a different gene product which is essential for proliferation of said organism 
and wherein the nucleotide sequence of each of the underexpressed genes has been altered 
so as to include a nucleotide sequence which can be used to generate a unique product 

25 corresponding to each of the underexpressed genes and wherein said culture comprises a 

strain in which a gene product whose activity or level is inhibited by a nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1- 
6213 is underexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 

30 the proliferation of strains of said organism which underexpress said gene product on which 

said compound acts, such that strains which underexpress said gene product on which said 
compound acts proliferate more slowly than strains which do not underexpress the gene 
product on which said compound acts; and 

identifying the gene product which is underexpressed in a strain which proliferated 

35 more rapidly in said culture by detecting the unique product corresponding to said gene. 

473. A 'method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 
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obtaining a culture comprising a plurality of strains wherein each strain 
underexpresses a different gene product which is essential for proliferation of said organism 
and wherein the nucleotide sequence of each of the underexpressed genes has been altered 
so as to include a nucleotide sequence which can be used to generate a unique product 
5 corresponding to each of the underexpressed genes and wherein said culture comprises a 

strain in which a gene product encoded by a nucleic acid comprising a nucleotide sequence 
selected from the group consisting of SEQ ID NOs.: 6214-42397 is underexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 
the proliferation of strains of said organism which underexpress said gene product on which 
10 said compound acts, such that strains which underexpress said gene product on which said 

compound acts proliferate more slowly than strains which do not underexpress the gene 
product on which said compound acts; and 

identifying the gene product which is underexpressed in a strain which proliferated 
more rapidly in said culture by detecting the unique product corresponding to said gene. 
15 474. A method for identifying the gene product on which a compound which inhibits 

proliferation of an organism acts comprising: 

obtaining a culture comprising a plurality of strains wherein each strain 
underexpresses a different gene product which is essential for proliferation of said organism 
and wherein the nucleotide sequence of each of the underexpressed genes has been altered 
20 so as to include a nucleotide sequence which can be used to generate a unique product 

corresponding to each of the underexpressed genes, wherein said culture comprises a strain 
in which a gene product comprising an amino acid sequence selected from the group 
consisting of SEQ ID NOs.: 42938-78581 is underexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 
25 the proliferation of strains of said organism which underexpress said gene product on which 

said compound acts, such that strains which underexpress said gene product on which said 
compound acts proliferate more slowly than strains which do not underexpress the gene 
product on which said compound acts; and 

identifying the gene product which is underexpressed in a strain which proliferated 
30 more rapidly in said culture by detecting the unique product corresponding to said gene. 

475. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

obtaining a culture comprising a plurality of strains wherein each strain 
underexpresses a different gene product which is essential for proliferation of said organism 
35 and wherein the nucleotide sequence of each of the underexpressed genes has been altered 

so as to include a nucleotide sequence which can be used to generate a unique product 
corresponding to each of the underexpressed genes, wherein said culture comprises a strain 
in which a gene product selected from the group consisting of a gene product having at 
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least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with the 
default parameters to a gene product whose expression is inhibited by an antisense nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 
1-6213, a gene product encoded by a nucleic acid having at least 70% nucleotide sequence 

5 identity as determined using BLASTN version 2.0 with the default parameters to a nucleic 

acid encoding a gene product whose expression is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 1- 
6213, a gene product having at least 25% amino acid identity as determined using FASTA 
version 3.0t78 with the default parameters to a gene product whose expression is inhibited 

10 by an antisense nucleic acid comprising a nucleotide sequence selected from the group 

consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic acid which 
hybridizes to a nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ED NOs.: 1-6213 under stringent conditions, a gene product encoded by 
a nucleic acid which hybridizes to a nucleic acid comprising a nucleotide sequence selected 

15 from the group consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene 

product whose activity may be complemented by the gene product whose activity is 
inhibited by a nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs: 1-6213 is underexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 

20 the proliferation of strains of said organism which underexpress said gene product on which 

said compound acts, such that strains which underexpress said gene product on which said 
compound acts proliferate more slowly than strains which do not underexpress the gene 
product on which said compound acts; and 

identifying the gene product which is underexpressed in a strain which proliferated 

25 more rapidly in said culture by detecting the unique product corresponding to said gene. 

476. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

obtaining a culture comprising a plurality of strains wherein each strain 
underexpresses a different gene product which is essential for proliferation of said organism 

30 and wherein the nucleotide sequence of each of the underexpressed genes has been altered 

so as to include a nucleotide sequence which can be used to generate a unique product 
corresponding to each of the underexpressed genes, wherein said culture comprises a strain 
in which a gene product encoded by a nucleic acid comprising a nucleotide sequence 
selected from the group consisting of a nucleic acid comprising a nucleic acid having at 

35 least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with the 

default parameters to a nucleotide sequence selected from the group consisting of SEQ ED 
NOS.: 6214-42397, a nucleic acid comprising a nucleotide sequence which hybridizes to a 
sequence selected from the group consisting of SEQ ID NOS.: 6214-42397 under stringent 
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conditions, and a nucleic acid comprising a nucleotide sequence which hybridizes to a 
nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214-42397 
under moderate conditions is underexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 
5 the proliferation of strains of said organism which underexpress said gene product on which 

said compound acts, such that strains which underexpress said gene product on which said 
compound acts proliferate more slowly than strains which do not underexpress the gene 
product on which said compound acts; and 

identifying the gene product which is underexpressed in a stain which proliferated 
10 more rapidly in said culture by detecting the unique product corresponding to said gene. 

477. A method for identifying the gene product on which a compound which inhibits 
proliferation of an organism acts comprising: 

obtaining a culture comprising a plurality of strains wherein each strain 
underexpresses a different gene product which is essential for proliferation of said organism 
15 and wherein the nucleotide sequence of each of the underexpressed genes has been altered 

so as to include a nucleotide sequence which can be used to generate a unique product 
corresponding to each of the underexpressed genes , wherein said culture comprises a strain 
in which a gene product comprises a polypeptide selected from the group consisting of a 
polypeptide having at least 25% amino acid identity as determined using FASTA version 
20 3.0t78 to a polypeptide selected from the group consisting of SEQ ID NOs.: 42938-78581 

and a polypeptide whose activity may be complemented by a polypeptide selected from the 
group consisting of SEQ ID NOs: 42938-78581 is underexpressed; 

contacting said culture with a sufficient concentration of said compound to inhibit 
the proliferation of strains of said organism which underexpress said gene product on which 
25 said compound acts, such that strains which underexpress said gene product on which said 

compound acts proliferate more slowly than strains which do not underexpress the gene 
product on which said compound acts; and 

identifying the gene product which is underexpressed in a strain which proliferated 
more rapidly in said culture by detecting the unique product corresponding to said gene. 
30 478. The method of Paragraph 472, 473, 474, 475, 476 or 477, wherein the nucleotide 

sequence of each of the genes encoding an underexpressed gene product has been altered by 
replacing the native promoters of said genes with promoters which facilitate underexpression of 
said gene products. 

479. The method of Paragraph 472, 473, 474, 475, 476 or 477, wherein the nucleotide 
35 sequence of each of the genes encoding an underexpressed gene product has been altered by 
inserting a regulatory element into the native promoters of said genes with a promoter which 
facilitates underexpression of said gene products. 
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480. The method of Paragraph 479, wherein said regulatory element is selected from the 
group consisting of a regulatable promoter, an operator which is recognized by a repressor, a 
nucleotide sequence which is recognized by a transcriptional activator, a transcriptional terminator, 
a nucleotide sequence which introduces a bend in the DNA and an upstream activating sequence. 
5 481. The method of Paragraph 472, 473, 474, 475, 476 or 477, wherein the step of 

identifying the gene product which is underexpressed in a strain which proliferated more slowly in 
said culture by detecting the unique product corresponding to said gene comprises performing an 
amplification reaction and detecting a unique amplification product corresponding to said gene. 

482. The method of Paragraph 478, wherein the native promoter of each of the genes 
10 encoding a gene product essential for proliferation is replaced with the same promoter. 

483. The method of Paragraph 478, wherein the native promoters of the genes encoding 
gene products essential for proliferation are replaced with a plurality of promoters selected to give a 
desired expression level for each gene product. 

484. The method of Paragraph 478, wherein said promoters which replaced the native 
15 promoters in each strain comprise regulatable promoters. 

485. The method of Paragraph 478, wherein said promoters which replaced the native 
promoters in each strain each strain comprise constitutive promoters. 

486. The method of Paragraph 472, 473, 474, 475, 476 or 477, wherein said organism is 
selected from the group consisting of bacteria, fungi, and protozoa. 

20 487. The method of Paragraph 472, 473, 474, 475, 476 or 477, wherein said culture is a 

culture of an organism selected from the group consisting of Acinetobacter baumannii, Anaplasma 
marginale, Aspergillus fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, 
Borrelia burgdoiferi, Burkholderia cepacia, Burkholderia fungorum, BurkJiolderia mallei, 
Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), 

25 Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida Icrusei, Candida kejyr 
(also called Candida pseudotropicalis\ Candida dubliniensis, Chlamydia pneumoniae, Chlamydia 
trachomatis, Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium 
perfringens, Coccidioides immitis, Corynebacterium dipiheriae, Crypiococcus neoformans, 
Enterobacter cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, 

30 Haemophilus influenzae, Helicobacter pylori, Histoplasma capsulatwn, Klebsiella pneumoniae, 
Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, 
Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma 
genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia 
asteroides, Pasteurella haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus 

35 mirabilis, Proteus vidgaris, Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas 
syringae, Salmonella bongori, Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, 
Salmonella typhi, Salmonella typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, 
Shigella sonnei, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, 
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Streptococcus pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, 
Ureaplasma urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia 
enterocolitica, Yersinia pestis and any species falling within the genera of any of the above species. 

488. A method for determining the extent to which each of a plurality of strains are 
5 present in a culture or collection of strains comprising: 

obtaining a nucleic acid sample comprising nucleic acids from a culture or 
collection of strains wherein said culture or collection of strains comprises a plurality of 
strains wherein each strain overexpresses or underexpresses a different gene product which 
is required for proliferation of said organism wherein said culture comprises a strain in 

10 which a gene product whose activity or level is inhibited by a nucleic acid comprising a 

nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213 is 
overexpressed or underexpressed; 

performing an amplification reaction using a set of primer pairs which are 
complementary to nucleotide sequences within or adjacent to the genes which encode said 

15 gene products, wherein the members of said set of primer pairs are designed such that each 

primer pair would yield an amplification product having a length distinguishable from the 
lengths of the amplification products from the other primer pairs if a strain comprising the 
nucleotide sequences complementary to said primer pair is present in said culture or 
collection of strains; and 

20 determining the lengths of the amplification products obtained in said amplification 

reaction. 

489. A method for determining the extent to which each of a plurality of strains are 
present in a culture or collection of strains comprising: 

obtaining a nucleic acid sample comprising nucleic acids from a culture or 
25 collection of strains wherein said culture or collection of strains comprises a plurality of 

strains wherein each strain overexpresses or underexpresses a different gene product which 
is required for proliferation of said organism , wherein said culture comprises a strain in 
which a gene product encoded by a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of SEQ ID NOs.: 6214-42397 is overexpressed or 
30 underexpressed; 

performing an amplification reaction using a set of primer pairs which are 
complementary to nucleotide sequences within or adjacent to the genes which encode said 
gene products, wherein the members of said set of primer pairs are designed such that each 
primer pair would yield an amplification product having a length distinguishable from the 
35 lengths of the amplification products from the other primer pairs if a strain comprising the 

nucleotide sequences complementary to said primer pair is present in said culture or 
collection of strains; and 
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determining the lengths of the amplification products obtained in said amplification 
reaction. 

490. A method for determining the extent to which each of a plurality of strains are 
present in a culture or collection of strains comprising: 

5 obtaining a nucleic acid sample comprising nucleic acids from a culture or 

collection of strains wherein said culture or collection of strains comprises a plurality of 
strains wherein each strain overexpresses or underexpresses a different gene product which 
is required for proliferation of said organism, wherein said culture comprises a strain in 
which a gene product comprising an amino acid sequence selected from the group 

10 consisting of SEQ ID NOs.: 42938-78581 is overexpressed or underexpressed; 

performing an amplification reaction using a set of primer pairs which are 
complementary to nucleotide sequences within or adjacent to the genes which encode said 
gene products, wherein the members of said set of primer pairs are designed such that each 
primer pair would yield an amplification product having a length distinguishable from the 

15 lengths of the amplification products from the other primer pairs if a strain comprising the 

nucleotide sequences complementary to said primer pair is present in said culture or 
collection of strains; and 

determining the lengths of the amplification products obtained in said amplification 
reaction. 

20 491. A method for determining the extent to which each of a plurality of strains are 

present in a culture or collection of strains comprising: 

obtaining a nucleic acid sample comprising nucleic acids from a culture or 
collection of strains wherein said culture or collection of strains comprises a plurality of 
strains wherein each strain overexpresses or underexpresses a different gene product which 

25 is required for proliferation of said organism , wherein said culture comprises a strain in 

which a gene product selected from the group consisting of a gene product having at least 
70% nucleotide sequence identity as determined using BLASTN version 2.0 with the 
default parameters to a gene product whose expression is inhibited by an antisense nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 

30 1-6213, a gene product encoded by a nucleic acid having at least 70% nucleotide sequence 

identity as determined using BLASTN version 2.0 with the default parameters to a nucleic 
acid encoding a gene product whose expression is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 1- 
6213, a gene product having at least 25% amino acid identity as determined using FASTA 

35 version 3.0t78 with the default parameters to a gene product whose expression is inhibited 

by an antisense nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic acid which 
hybridizes to a nucleic acid comprising a nucleotide sequence selected from the group 

100 



WO 02/077183 PCT/US02/09107 

consisting of SEQ ID NOs.: 1-6213 under stringent conditions, a gene product encoded by 
a nucleic acid which hybridizes to a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene 
product whose activity may be complemented by the gene product whose activity is 

5 inhibited by a nucleic acid comprising a nucleotide sequence selected from the group 

consisting of SEQ ID NOs: 1-6213 is overexpressed or underexpressed; 

performing an amplification reaction using a set of primer pairs which are 
complementary to nucleotide sequences within or adjacent to the genes which encode said 
gene products, wherein the members of said set of primer pairs are designed such that each 

10 primer pair would yield an amplification product having a length distinguishable from the 

lengths of the amplification products from the other primer pairs if a strain comprising the 
nucleotide sequences complementary to said primer pair is present in said culture or 
collection of strains; and 

determining the lengths of the amplification products obtained in said amplification 

15 reaction. 

492. A method for determining the extent to which each of a plurality of strains are 
present in a culture or collection of strains comprising: 

obtaining a nucleic acid sample comprising nucleic acids from a culture or 
collection of strains wherein said culture or collection of strains comprises a plurality of 

20 strains wherein each strain overexpresses or underexpresses a different gene product which 

is required for proliferation of said organism , wherein said culture comprises a strain in 
which a gene product encoded by a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of a nucleic acid comprising a nucleic acid having at least 70% 
nucleotide sequence identity as determined using BLASTN version 2.0 with the default 

25 parameters to a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 

6214-42397, a nucleic acid comprising a nucleotide sequence which hybridizes to a 
sequence selected from the group consisting of SEQ ID NOS.: 6214-42397 under stringent 
conditions, and a nucleic acid comprising a nucleotide sequence which hybridizes to a 
nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214-42397 

30 under moderate conditions is overexpressed or underexpressed; 

performing an amplification reaction using a set of primer pairs which are 
complementary to nucleotide sequences within or adjacent to the genes which encode said 
gene products, wherein the members of said set of primer pairs are designed such that each 
primer pair would yield an amplification product having a length distinguishable from the 

35 lengths of the amplification products from the other primer pairs if a strain comprising the 

nucleotide sequences complementary to said primer pair is present in said culture or 
collection of strains; and 
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determining the lengths of the amplification products obtained in said amplification 
reaction. 

493. A method for determining the extent to which each of a plurality of strains are 
present in a culture or collection of strains comprising: 

5 obtaining a nucleic acid sample comprising nucleic acids from a culture or 

collection of strains wherein said culture or collection of strains comprises a plurality of 
strains wherein each strain overexpresses or underexpresses a different gene product which 
is required for proliferation of said organism , wherein said culture comprises a strain in 
which a gene product comprising a polypeptide selected from the group consisting of a 

10 polypeptide having at least 25% amino acid identity as determined using FASTA version 

3.0t78 to a polypeptide selected from the group consisting of SEQ ID NOs.: 42938-78581 
and a polypeptide whose activity may be complemented by a polypeptide selected from the 
group consisting of SEQ ID NOs: 42938-78581 is overexpressed or underexpressed; 

performing an amplification reaction using a set of primer pairs which are 

15 complementary to nucleotide sequences within or adjacent to the genes which encode said 

gene products, wherein the members of said set of primer pairs are designed such that each 
primer pair would yield an amplification product having a length distinguishable from the 
lengths of the amplification products from the other primer pairs if a strain comprising the 
nucleotide sequences complementary to said primer pair is present in said culture or. 

20 collection of strains; and 

determining the lengths of the amplification products obtained in said amplification 
reaction. 

494. The method of Paragraph 488, 489, 490, 491, 492 or 493, wherein one member of 
each primer pair for each of said genes is labeled with a detectable dye. 

25 495. The method of Paragraph 488, 489, 490, 491, 492 or 493, wherein: 

said nucleic acid sample is divided into N aliquots; and 

said amplification reaction is performed on each aliquot using primer pairs 
complementary to nucleotide sequences within or adjacent to 1/N of the genes which 
encode said gene products, wherein one of the members of each primer pair in each aliquot 
30 is labeled with a dye and wherein the dyes on the primers in each aliquot are 

distinguishable from one another. 

496. The method of Paragraph 494, further comprising pooling the amplification 
products from each of the aliquots prior to determining the lengths of the amplification products. 

497. The method of Paragraph 488, 489, 490, 491, 492 or 493, wherein the native 
35 promoters of said genes which encode said gene products have been replaced with a regulatable 

promoter and one "of the primers in said primer pairs is complementary to a nucleotide sequence 
within said regulatable promoter. 
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498. The method of Paragraph 496, wherein the native promoters for each of said genes 
were replaced with the same regulatable promoter. 

499. The method of Paragraph 496, wherein more than one regulatable promoter was 
used to replace the promoters of said genes such that some of said genes are under the control of a 

5 different regulatable promoter. 

500. A method for identifying the target of a compound which inhibits the proliferation 
of an organism comprising: 

obtaining a first nucleic acid sample comprising nucleic acids from a first culture or 
collection of strains wherein said culture or collection of strains comprises a plurality of 

10 strains wherein each strain overexpresses or underexpresses a different gene product which 

is required for proliferation of said organism and wherein said culture or collection of 
strains has been contacted with said compound; 

obtaining a second nucleic acid sample comprising nucleic acids from a second 
culture or collection of strains wherein said culture or collection of strains comprises the 

15 same strains as said first culture or collection of strains wherein said second culture or 

collection of strains has not been contacted with said compound; 

performing a first amplification reaction on said first nucleic acid sample using a 
set of primer pairs which are complementary to nucleotide sequences within or adjacent to 
the genes which encode said gene products, wherein the members of said set of primer pairs 

20 are designed such that each primer pair would yield an amplification product having a 

length distinguishable from the lengths of the amplification products from the other primer 
pairs if a strain comprising the nucleotide sequences complementary to said primer pair is 
present in said culture or collection of strains; 

performing a second amplification reaction on said second nucleic acid sample 

25 using the same set of primer pairs used in said first amplification reaction; 

and comparing the amount of each amplification product in said first amplification 
reaction to the amount of that amplification product in said second amplification reaction, 
wherein an increased level of an amplification product in said first amplification reaction 
relative to said second amplification reaction indicates that the gene product corresponding 

30 to said amplification product is the target of said compound if said culture or strain 

overexpresses said gene products and a decreased level of of an amplification product in 
said first amplification reaction relative to said second amplification reaction indicates that 
the gene product corresponding to said amplification product is the target of said compound 
if said culture or strain overexpresses said gene products, wherein said first and second 

35 cultures or collection of strains comprise a strain in which a gene product whose activity or 

level is inhibited by a nucleic acid comprising a nucleotide sequence selected from the 
group consisting of SEQ ED NOs.: 1-6213 is overexpressed or underexpressed. 
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501 . A method for identifying the target of a compound which inhibits the proliferation 
of an organism comprising: 

obtaining a first nucleic acid sample comprising nucleic acids from a first culture or 
collection of strains wherein said culture or collection of strains comprises a plurality of 
5 strains wherein each strain overexpresses or underexpresses a different gene product which 

is required for proliferation of said organism and wherein said culture or collection of 
strains has been contacted with said compound; 

obtaining a second nucleic acid sample comprising nucleic acids from a second 
culture or collection of strains wherein said culture or collection of strains comprises the 
10 same strains as said first culture or collection of strains wherein said second culture or 

collection of strains. has not been contacted with said compound; 

performing a first amplification reaction on said first nucleic acid sample using a 
set of primer pairs which are complementary to nucleotide sequences within or adjacent to 
the genes which encode said gene products, wherein the members of said set of primer pairs 
15 are designed such that each primer pair would yield an amplification product having a 

length distinguishable from the lengths of the amplification products from the other primer 
pairs if a strain comprising the nucleotide sequences complementary to said primer pair is 
present in said culture or collection of strains; 

performing a second amplification reaction on said second nucleic acid sample 
20 using the same set of primer pairs used in said first amplification reaction; 

and comparing the amount of each amplification product in said first amplification 
reaction to the amount of that amplification product in said second amplification reaction, 
wherein an increased level of an amplification product in said first amplification reaction 
relative to said second amplification reaction indicates that the gene product corresponding 
25 to said amplification product is the target of said compound if said culture or strain 

overexpresses said gene products and a decreased level of of an amplification product in 
said first amplification reaction relative to said second amplification reaction indicates that 
the gene product corresponding to said amplification product is the target of said compound 
if said culture or strain overexpresses said gene products, wherein said first and second 
30 cultures or collection of strains comprise a strain in which a gene product encoded by a 

nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 
ID NOs.: 6214-42397 is overexpressed or underexpressed. 

502. A method for identifying the target of a compound which inhibits the proliferation 
of an organism comprising: 
35 obtaining a first nucleic acid sample comprising.nucleic acids from a first culture or 

collection of strains wherein said culture or collection of strains comprises a plurality of 
strains wherein each strain overexpresses or underexpresses a different gene product which 



104 



WO 02/077183 PCT/US02/09107 

is required for proliferation of said organism and wherein said culture or collection of 
strains has been contacted with said compound; 

obtaining a second nucleic acid sample comprising nucleic acids from a second 
culture or collection of strains wherein said culture or collection of strains comprises the 
5 same strains as said first culture or collection of strains wherein said second culture or 

collection of strains has not been contacted with said compound; 

performing a first amplification reaction on said first nucleic acid sample using a 
set of primer pairs which are complementary to nucleotide sequences within or adjacent to 
the genes which encode said gene products, wherein the members of said set of primer pairs 
10 are designed such that each primer pair would yield an amplification product having a 

length distinguishable from the lengths of the amplification products from the other primer 
pairs if a strain comprising the nucleotide sequences complementary to said primer pair is 
present in said culture or collection of strains; 

performing a second amplification reaction on said second nucleic acid sample 
15 using the same set of primer pairs used in said first amplification reaction; 

and comparing the amount of each amplification product in said first amplification 
reaction to the amount of that amplification product in said second amplification reaction, 
wherein an increased level of an amplification product in said first amplification reaction 
relative to said second amplification reaction indicates that the gene product corresponding 
20 to said amplification product is the target of said compound if said culture or strain 

overexpresses said gene products and a decreased level of of an amplification product in 
said first amplification reaction relative to said second amplification reaction indicates that 
the gene product corresponding to said amplification product is the target of said compound 
if said culture or strain overexpresses said gene products, wherein said first and second 
25 cultures or collection of strains comprise a strain in which a gene product comprising an 

amino acid ^sequence selected from the group consisting of SEQ ID NOs.: 42938-78581 is 
overexpressed or underexpressed. 

503. A method for identifying the target of a compound which inhibits the proliferation 
of an organism comprising: 

30 obtaining a first nucleic acid sample comprising nucleic acids from a first culture or 

collection of strains wherein said culture or collection of strains comprises a plurality of 
strains wherein each strain overexpresses or underexpresses a different gene product which 
is required for proliferation of said organism and wherein said culture or collection of 
strains has been contacted with said compound; 

35 obtaining a second nucleic acid sample comprising nucleic acids from a second 

culture or collection of strains wherein said culture or collection of strains comprises the 
same strains as said first culture or collection of strains wherein said second culture or 
collection of strains has not been contacted with said compound; 
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performing a first amplification reaction on said first nucleic acid sample using a 
set of primer pairs which are complementary to nucleotide sequences within or adjacent to 
the genes which encode said gene products, wherein the members of said set of primer pairs 
are designed such that each primer pair would yield an amplification product having a 
5 length distinguishable from the lengths of the amplification products from the other primer 

pairs if a strain comprising the nucleotide sequences complementary to said primer pair is 
present in said culture or collection of strains; 

performing a second amplification reaction on said second nucleic acid sample 
using the same set of primer pairs used in said first amplification reaction; 
10 and comparing the amount of each amplification product in said first amplification 

reaction to the amount of that amplification product in said second amplification reaction, 
wherein an increased level of an amplification product in said first amplification reaction 
relative to said second amplification reaction indicates that the gene product corresponding 
to said amplification product is the target of said compound if said culture or strain 
15 overexpresses said gene products and a decreased level of of an amplification product in 

said first amplification reaction relative to said second amplification reaction indicates that 
the gene product corresponding to said amplification product is the target of said compound 
if said culture or strain overexpresses said gene products, wherein said first and second 
cultures or collection of strains comprise a strain in which a gene product selected from the 
20 group consisting of a gene product having at least 70% nucleotide sequence identity as 

determined using BLASTN version 2.0 with the default parameters to a gene product 
whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product 
encoded by a nucleic acid having at least 70% nucleotide sequence identity as determined 
25 using BLASTN version 2.0 with the default parameters to a nucleic acid encoding a gene 

product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs: 1-6213, a gene product 
having at least 25% amino acid identity as determined using FASTA version 3.0t78 with 
the default parameters to a gene product whose expression is inhibited by an antisense 
30 nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 

ID NOs.: 1-6213, a gene product encoded by a nucleic acid which hybridizes to a nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 
1-6213 under stringent conditions, a gene product encoded by a nucleic acid which 
hybridizes to a nucleic acid comprising a nucleotide sequence selected from the group 
35 consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene product whose 

activity may be complemented by the gene product whose activity is inhibited by a nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 
1-6213 is overexpressed or underexpressed. 
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504. A method for identifying the target of a compound which inhibits the proliferation 

of an organism comprising: 

obtaining a first nucleic acid sample comprising nucleic acids from a first culture or 
collection of strains wherein said culture or collection of strains comprises a plurality of 

5 strains wherein each strain overexpresses or underexpresses a different gene product which 

is required for proliferation of said organism and wherein said culture or collection of 
strains has been contacted with said compound; 

obtaining a second nucleic acid sample comprising nucleic acids from a second 
culture or collection of strains wherein said culture or collection of strains comprises the 

10 same strains as said first culture or collection of strains wherein said second culture or 

collection of strains has not been contacted with said compound; 

performing a first amplification reaction on said first nucleic acid sample using a 
set of primer pairs which are complementary to nucleotide sequences within or adjacent to 
the genes which encode said gene products, wherein the members of said set of primer pairs 

15 are designed such that each primer pair would yield an amplification product having a 

length distinguishable from the lengths of the amplification products from the other primer 
pairs if a strain comprising the nucleotide sequences complementary to said primer pair is 
present in said culture or collection of strains; 

performing a second amplification reaction on said second nucleic acid sample 

20 using the same set of primer pairs used in said first amplification reaction; 

and comparing the amount of each amplification product in said first amplification 
reaction to the amount of that amplification product in said second amplification reaction, 
wherein an increased level of an amplification product in said first amplification reaction 
relative to said second amplification reaction indicates that the gene product corresponding 

25 to said amplification product is the target of said compound if said culture or strain 

overexpresses said gene products and a decreased level of of an amplification product in 
said first amplification reaction relative to said second amplification reaction indicates that 
the gene product corresponding to said amplification product is the target of said compound 
if said culture or strain overexpresses said gene products, wherein said first and second 

30 cultures or collection of strains comprise a strain in which a gene product encoded by a 

nucleic acid comprising a nucleotide sequence selected from the group consisting of a 
nucleic acid comprising a nucleic acid having at least 70% nucleotide sequence identity as 
determined using BLASTN version 2.0 with the default parameters to a nucleotide 
sequence selected from the group consisting of SEQ ID NOS.: 6214-42397, a nucleic acid 

35 comprising a nucleotide sequence which hybridizes to a sequence selected from the group 

consisting of SEQ ED NOS.: 6214-42397 under stringent conditions, and a nucleic acid 
comprising a nucleotide sequence which hybridizes to a nucleotide sequence selected from 
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the group consisting of SEQ ID NOS.: 6214-42397 under moderate conditions is 
overexpressed or underexpressed. 

505. A method for identifying the target of a compound which inhibits the proliferation 
of an organism comprising: 

5 obtaining a first nucleic acid sample comprising nucleic acids from a first culture or 

collection of strains wherein said culture or collection of strains comprises a plurality of 
strains wherein each strain overexpresses or underexpresses a different gene product which 
is required for proliferation of said organism and wherein said culture or collection of 
strains has been contacted with said compound; 

10 obtaining a second nucleic acid sample comprising nucleic acids from a second 

culture or collection of strains wherein said culture or collection of strains comprises the 
same strains as said first culture or collection of strains wherein said second culture or 
collection of strains has not been contacted with said compound; 

performing a first amplification reaction on said first nucleic acid sample using a 

1 5 set of primer pairs which are complementary to nucleotide sequences within or adjacent to 

the genes which encode said gene products, wherein the members of said set of primer pairs 
are designed such that each primer pair would yield an amplification product having a 
length distinguishable from the lengths of the amplification products from the other primer 
pairs if a strain comprising the nucleotide sequences complementary to said primer pair is 

20 present in said culture or collection of strains; 

performing a second amplification reaction on said second nucleic acid sample 
using the same set of primer pairs used in said first amplification reaction; 

and comparing the amount of each amplification product in said first amplification 
reaction to the amount of that amplification product in said second amplification reaction, 

25 wherein an increased level of an amplification product in said first amplification reaction 

relative to said second amplification reaction indicates that the gene product corresponding 
to said amplification product is the target of said compound if said culture or strain 
overexpresses said gene products and a decreased level of of an amplification product in 
said first amplification reaction relative to said second amplification reaction indicates that 

30 the gene product corresponding to said amplification product is the target of said compound 

if said culture or strain overexpresses said gene products, wherein said first and second 
culture or collection of strains comprise a strain in which a gene product comprising a 
polypeptide selected from the group consisting of a polypeptide having at least 25% amino 
acid identity as determined using FASTA version 3.0t78 to a polypeptide selected from the 

35 group consisting of SEQ ID NOs.: 42938-78581 and a polypeptide whose activity may be 

complemented by a polypeptide selected from the group consisting of SEQ ID NOs: 42938- 
78581 is overexpressed or underexpressed. 
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506. The method of Paragraph 500, 501, 502, 503, 504 or 505, wherein one member of 
each primer pair for each of said genes is labeled with a detectable dye. 

507. The method of Paragraph 500, 501, 502, 503, 504 or 505, wherein the native 
promoters of said genes which encode said gene products have been replaced with a regulatable 

5 promoter and one of the primers in said primer pairs is complementary to a nucleotide sequence 
within said regulatable promoter. 

508. The method of Paragraph 500, 501, 502, 503, 504 or 505, wherein the native 
promoters for each of said genes were replaced with the same regulatable promoter. 

509. The method of Paragraph 500, 501, 502, 503, 504 or 505, wherein more than one 
10 regulatable promoter was used to replace the promoters of said genes such that some of said genes 

are under the control of a different regulatable promoter. 

510. A method for determining the extent to which each of a plurality of strains are 
present in a culture or collection of strains comprising: 

obtaining a nucleic acid sample comprising nucleic acids from a culture or 
15 collection of strains wherein said culture or collection of strains comprises a plurality of 

strains which transcribe an antisense nucleic acid complementary to a different ,gene 

product which is required for proliferation of said organism; 

performing an amplification reaction using a set of primer pairs which are 

complementary to nucleotide sequences within or adjacent to the nucleic acids which 
20 encode said antisense nucleic acids, wherein the members of said set of primer pairs are 

designed such that each primer pair would yield an amplification product having a length . 

distinguishable from the lengths of the amplification products from the other primer pairs if 

a strain comprising the nucleotide sequences complementary to said primer pair is present 

in said culture or collection of strains; and 
25 determining the lengths of the amplification products obtained in said amplification 

reaction, wherein said culture comprises a strain in which a gene product whose activity or 

level is inhibited by a nucleic acid comprising a nucleotide sequence selected from the 

group consisting of SEQ ED NOs.: 1-6213 is overexpressed or underexpressed. 

511. A method for determining the extent to which each of a plurality of strains are 
30 present in a culture or collection of strains comprising: 

obtaining a nucleic acid sample comprising nucleic acids from a culture or 
collection of strains wherein said culture or collection of strains comprises a plurality of 
strains which transcribe an antisense nucleic acid complementary to a different gene 
product which is required for proliferation of said organism; 
35 performing an amplification reaction using a set of primer pairs which are 

complementary to nucleotide sequences within or adjacent to the nucleic acids which 
encode said antisense nucleic acids, wherein the members of said set of primer pairs are 
designed such that each primer pair would yield an amplification product having a length 
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distinguishable from the lengths of the amplification products from the other primer pairs if 
a strain comprising the nucleotide sequences complementary to said primer pair is present 
in said culture or collection of strains; and 

determining the lengths of the amplification products obtained in said amplification 
5 reaction, wherein said culture comprises a strain in which a gene product encoded by a 

nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 
ID NOs.: 6214-42397 is overexpressed or underexpressed. 

512. A method for determining the extent to which each of a plurality of strains are 
present in a culture or collection of strains comprising: 

10 obtaining a nucleic acid sample comprising nucleic acids from a culture or 

collection of strains wherein said culture or collection of strains comprises a plurality of 
strains which transcribe an antisense nucleic acid complementary to a different gene 
product which is required for proliferation of said organism; 

performing an amplification reaction using a set of primer pairs which are 

15 complementary to nucleotide sequences within or adjacent to the nucleic acids which 

encode said antisense nucleic acids, wherein the members of said set of primer pairs are 
designed such that each primer pair would yield an amplification product having a length 
distinguishable from the lengths of the amplification products from the other primer pairs if 
a strain comprising the nucleotide sequences complementary to said primer pair is present 

20 in said culture or collection of strains; and 

determining the lengths of the amplification products obtained in said amplification 
reaction, wherein said culture comprises a strain in which a gene product comprising an 
amino acid sequence selected from the group consisting of SEQ ID NOs.: 42938-78581 is 
overexpressed or underexpressed. 

25 513. A method for determining the extent to which each of a plurality of strains are 

present in a culture or collection of strains comprising: 

obtaining a nucleic acid sample comprising nucleic acids from a culture or 
collection of strains wherein said culture or collection of strains comprises a plurality of 
strains which transcribe an antisense nucleic acid complementary to a different gene 

30 product which is required for proliferation of said organism; 

performing an amplification reaction using a set of primer pairs which are 
complementary to nucleotide sequences within or adjacent to the nucleic acids which 
encode said antisense nucleic acids, wherein the members of said set of primer pairs are 
designed such that each primer pair would yield an amplification product having a length 

35 distinguishable from the lengths of the amplification products from the other primer pairs if 

a strain comprising the nucleotide sequences complementary to said primer pair is present 
in said culture or collection of strains; and 
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determining the lengths of the amplification products obtained in said amplification 
reaction, wherein said culture comprises a strain in which a gene product selected from the 
group consisting of a gene product having at least 70% nucleotide sequence identity as 
determined using BLASTN version 2.0 with the default parameters to a gene product 

5 whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 

sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product 
encoded by a nucleic acid having at least 70% nucleotide sequence identity as determined 
using BLASTN version 2.0 with the default parameters to a nucleic acid encoding a gene 
product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 

10 sequence selected from the group consisting of SEQ ID NOs: 1-6213, a gene product 

having at least 25% amino acid identity as determined using FASTA version 3.0t78 with 
the default parameters to a gene product whose expression is inhibited by an antisense 
nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 
ID NOs.: 1-6213, a gene product encoded by a nucleic acid which hybridizes to a nucleic 

15 acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 

1-6213 under stringent conditions, a gene product encoded by a nucleic acid which 
hybridizes to a nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene product whose 
activity may be complemented by the gene product whose activity is inhibited by a nucleic 

20 acid comprising a nucleotide sequence selected from the group consisting of SEQ ED NOs: 

1-6213 is overexpressed or underexpressed. 

514. A method for determining the extent to which each of a plurality of strains are 
present in a culture or collection of strains comprising: 

obtaining a nucleic acid sample comprising nucleic acids from a culture or 

25 collection of strains wherein said culture or collection of strains comprises a plurality of 

strains which transcribe an antisense nucleic acid complementary to a different gene 
product which is required for proliferation of said organism; 

performing an amplification reaction using a set of primer pairs which are 
complementary to nucleotide sequences within or adjacent to the nucleic acids which 

30 encode said antisense nucleic acids, wherein the members of said set of primer pairs are 

designed such that each primer pair would yield an amplification product having a length 
distinguishable from the lengths of the amplification products from the other primer pairs if 
a strain comprising the nucleotide sequences complementary to said primer pair is present 
in said culture or collection of strains; and 

35 determining the lengths of the amplification products obtained in said amplification 

reaction, wherein said culture comprises a stain in which a gene product encoded by a 
nucleic acid comprising a nucleotide sequence selected from the group consisting of a 
nucleic acid comprising a nucleic acid having at least 70% nucleotide sequence identity as 

in 
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determined using BLASTN version 2.0 with the default parameters to a nucleotide 
sequence selected from the group consisting of SEQ ID NOS.: 6214-42397, a nucleic acid 
comprising a nucleotide sequence which hybridizes to a sequence selected from the group 
consisting of SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic acid 
5 comprising a nucleotide sequence which hybridizes to a nucleotide sequence selected from 

the group consisting of SEQ ID NOS.: 6214-42397 under moderate conditions is 
overexprcssed or underexpressed. 

515. A method for determining the extent to which each of a plurality of strains are 
present in a culture or collection of strains comprising: 

10 obtaining a nucleic acid sample comprising nucleic acids from a culture or 

collection of strains wherein said culture or collection of strains comprises a plurality of 
strains which transcribe an antisense nucleic acid complementary to a different gene 
product which is required for proliferation of said organism; 

performing an amplification reaction using a set of primer pairs which are 

15 complementary to nucleotide sequences within or adjacent to the nucleic acids which 

encode said antisense nucleic acids, wherein the members of said set of primer pairs are 
designed such that each primer pair would yield an amplification product having a length 
distinguishable from the lengths of the amplification products from the other primer pairs if 
a strain comprising the nucleotide sequences complementary to said primer pair is present 

20 in said culture or collection of strains; and 

determining the lengths of the amplification products obtained in said amplification 
reaction, wherein said culture comprises a strain in which a gene product comprising a 
polypeptide selected from the group consisting of a polypeptide having at least 25% amino 
acid identity as determined using FASTA version 3.0t78 to a polypeptide selected from the 

25 group consisting of SEQ ID NOs.: 42938-78581 and a polypeptide whose activity may be 

complemented by a polypeptide selected from the group consisting of SEQ ED NOs: 42938- 
78581 is overexpressed or underexpressed. 

516. The method of Paragraph 510, 511, 512, 513, 514 or 515, wherein one member of 
each primer pair for each of said genes is labeled with a detectable dye. 

30 517. The method of Paragraph 510, 511, 512, 513, 514 or 515, wherein: 

said nucleic acid sample is divided into N aliquots; and 

said amplification reaction is performed on each aliquot using primer pairs 
complementary to nucleotide sequences within or adjacent to 1/N of the genes which 
encode said gene products, wherein one of the members of each primer pair in each aliquot 
35 is labeled with a dye and wherein the dyes on the primers in each aliquot are 

distinguishable from one another. 

518. The method of Paragraph 517, further comprising pooling the amplification 
products from each of the aliquots prior to determining the lengths of the amplification products. 
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519. A method for determining the extent to which each of a plurality of strains are 
present in a culture or collection of strains comprising: 

obtaining a nucleic acid sample comprising nucleic acids from a culture or 
collection of strains wherein said culture or collection of strains comprises a plurality of 

5 strains which overexpress or underexpress a different gene product which is required for 

proliferation of said organism; 

performing an amplification reaction using primer pairs which are complementary 
to nucleotide sequences within or adjacent to the genes which encode said gene products, 
wherein said primer pairs are designed such that each primer pair would yield an 

10 amplification product which is distinguishable from the amplification products produced by 

the other primer pairs on the a basis selected from the group consisting of length, detectable 
label and both length and detectable label if a strain comprising the nucleotide sequences 
complementary to said primer pair is present in said culture or collection of strains; and 

identifying the amplification products obtained in said amplification reaction, 

15 wherein said culture comprises a strain in which a gene product whose activity or level is 

inhibited by a nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 1-6213 is overexpressed or underexpressed. 

520. A method for determining the extent to which each of a plurality of strains are 
present in a culture or collection of strains comprising: 

20 obtaining a nucleic acid sample comprising nucleic acids from a culture or 

collection of strains wherein said culture or collection of strains comprises a plurality of 
strains which overexpress or underexpress a different gene product which is required for 
proliferation of said organism; jfe- - , 

performing an amplification reaction using primer pairs which are complementary 
25 to nucleotide sequences within or adjacent to the genes which encode said gene products, 

wherein said primer pairs are designed such that each primer pair would yield an 
amplification product which is distinguishable from the amplification products produced by 
the other primer pairs on the a basis selected from the group consisting of length, detectable 
label and both length and detectable label if a strain comprising the nucleotide sequences 
30 complementary to said primer pair is present in said culture or collection of strains; and 

identifying the amplification products obtained in said amplification reaction, 
wherein said culture comprises a strain in which a gene product encoded by a nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 
6214-42397 is overexpressed or underexpressed. 
35 521. A method for determining the- extent to which each of a plurality of strains are 

present in a culture or collection of strains comprising: 

obtaining a nucleic acid sample comprising nucleic acids from a culture or 
collection of strains wherein said culture or collection of strains comprises a plurality of 
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strains which overexpress or underexpress a different gene product which is required for 
proliferation of said organism; 

performing an amplification reaction using primer pairs which are complementary 
to nucleotide sequences within or adjacent to the genes which encode said gene products, 

5 wherein said primer pairs are designed such that each primer pair would yield an 

amplification product which is distinguishable from the amplification products produced by 
the other primer pairs on the a basis selected from the group consisting of length, detectable 
label and both length and detectable label if a strain comprising the nucleotide sequences 
complementary to said primer pair is present in said culture or collection of strains; and 

10 identifying the amplification products obtained in said amplification reaction, 

wherein said culture comprises a strain in which a gene product comprising an amino acid 
sequence selected from the group consisting of SEQ ID NOs.: 42938-78581 is 
overexpressed or underexpressed. 

522. A method for determining the extent to which each of a plurality of strains are 

1 5 present in a culture or collection of strains comprising: 

obtaining a nucleic acid sample comprising nucleic acids from a culture or 
collection of strains wherein said culture or collection of strains comprises a plurality of 
strains which oyerexpress or underexpress a different gene product which is required for 
proliferation of said organism; 

20 performing an amplification reaction using primer pairs which are complementary 

to nucleotide sequences within or adjacent to the genes which encode said gene products, 
wherein said primer pairs are designed such that each primer pair would yield an 
amplification product which is distinguishable from the amplification products produced by 
the other primer pairs on the a basis selected from the group consisting of length, detectable 

25 label and both length and detectable label if a strain comprising the nucleotide sequences 

complementary to said primer pair is present in said culture or collection of strains; and 

identifying the amplification products obtained in said amplification reaction, 
wherein said culture comprises a strain in which a gene product selected from the group 
consisting of a gene product having at least 70% nucleotide sequence identity as determined 

30 using BLASTN version 2.0 with the default parameters to a gene product whose expression 

is inhibited by an antisense nucleic acid comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic acid having 
at least 70% nucleotide sequence identity as determined using BLASTN version 2,0 with 
the default parameters to a nucleic acid encoding a gene product whose expression is 

35 inhibited by an antisense nucleic acid comprising a nucleotide sequence selected from the 

group consisting of SEQ ID NOs: 1-6213, a gene product having at least 25% amino acid 
identity as determined using FASTA version 3.0t78 with the default parameters to a gene 
product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
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sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a~gene product 
encoded by a nucleic acid which hybridizes to a nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 1-6213 under stringent 
conditions, a gene product encoded by a nucleic acid which hybridizes to a nucleic acid 
5 comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1- 

6213 under moderate conditions, and a gene product whose activity may be complemented 
by the gene product whose activity is inhibited by a nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs: 1-6213 is overexpressed or 
underexpressed. 

10 523. A method for determining the extent to which each of a plurality of strains are 

present in a culture or collection of strains comprising: 

obtaining a nucleic acid sample comprising nucleic acids from a culture or 
collection of strains wherein said culture or collection of strains comprises a plurality of 
strains which overexpress or underexpress a different gene product which is required for 

1 5 proliferation of said organism; 

performing an amplification reaction using primer pairs which are complementary 
to nucleotide sequences within or adjacent to the genes which encode said gene products, 
wherein said primer pairs are designed such that each primer pair would yield an 
amplification product which is distinguishable from the amplification products produced by 

20 the other primer pairs on the a basis selected from the group consisting of length, detectable 

label and both length and detectable label if a strain comprising the nucleotide sequences 
complementary to said primer pair is present in said culture or collection of strains; and 

identifying the amplification products obtained in said amplification reaction, 
wherein said culture comprises a strain in which a gene product encoded by a nucleic acid 

25 comprising a nucleotide sequence selected from the group consisting of a nucleic acid 

comprising a nucleic acid having at least 70% nucleotide sequence identity as determined 
using BLASTN version 2.0 with the default parameters to a nucleotide sequence selected 
from the group consisting of SEQ ID NOS.: 6214-42397, a nucleic acid comprising a 
nucleotide sequence which hybridizes to a sequence selected from the group consisting of 

30 SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic acid comprising a 

nucleotide sequence which hybridizes to a nucleotide sequence selected from the group 
consisting of SEQ ID NOS.: 6214-42397 under moderate conditions is overexpressed or 
underexpressed. 

524. A method for determining the extent to which each of a plurality of strains are 
3 5 present in a culture or collection of strains comprising: 1 

obtaining a nucleic acid sample comprising nucleic acids from a culture or 
collection of strains wherein said culture or collection of strains comprises a plurality of 
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strains which overexpress or underexpress a different gene product which is required for 
proliferation of said organism; 

performing an amplification reaction using primer pairs which are complementary 
to nucleotide sequences within or adjacent to the genes which encode said gene products, 

5 wherein said primer pairs are designed such that each primer pair would yield an 

amplification product which is distinguishable from the amplification products produced by 
the other primer pairs on the a basis selected from the group consisting of length, detectable 
label and both length and detectable label if a strain comprising the nucleotide sequences 
complementary to said primer pair is present in said culture or collection of strains; and 

10 identifying the amplification products obtained in said amplification reaction, 

wherein said culture comprises a strain in which a gene product comprising a polypeptide 
selected from the group consisting of a polypeptide having at least 25% amino acid identity 
as determined using FASTA version 3.0t78 to a polypeptide selected from the group 
consisting of SEQ ID NOs.: 42938-78581 and a polypeptide whose activity may be 

15 complemented by a polypeptide selected from the group consisting of SEQ ID NOs: 42938- 

78581 is overexpressed or underexpressed. 

525. The method of Paragraph 519, 520, 521, 522, 523 or 524, wherein said primer pairs 
are divided into at least two sets, each primer pair comprises a primer which is labeled with a 
distinguishable dye, and the distinguishable dye used to label each set of primer pairs is 

20 distinguishable from the dye used to label the other sets of primer pairs. 

526. The method of Paragraph 519, 520, 521, 522, 523 or 524, wherein: 
said nucleic acid sample is divided into N aliquots; and 

said amplification reaction is performed on each aliquot using primer pairs 
complementary to nucleotide sequences within or adjacent to 1/N of the genes which 
25 encode said gene products, wherein one of the members of each primer pair in each aliquot 

is labeled with a dye and wherein the dyes on the primers in each aliquot are 
distinguishable from one another. 

527. The method of Paragraph 526, further comprising pooling the amplification 
products from each of the aliquots prior to determining the lengths of the amplification products. 

30 528. The method of Paragraph 519, 520, 521, 522, 523 or 524, wherein the native 

promoters of said genes which encode said gene products have been replaced with a regulatable 
promoter and one of the primers in said primer pairs is complementary to a nucleotide sequence 
within said regulatable promoter. 

529. The method of Paragraph 528, wherein the native promoters for each of said genes 
35 were replaced with the same regulatable promoter. 

530. The method of Paragraph 528, wherein more than one regulatable promoter was 
used to replace the promoters of said genes such that some of said genes are under the control of a 
different regulatable promoter. 
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DsfinitiQns 

By biological pathway" is meant any discrete cell function or process that is carried out by 
a gene product or a subset of gene products. Biological pathways include anabolic, catabolic, 
enzymatic, biochemical and metabolic pathways as well as pathways involved in the production of 

5 cellular structures such as cell walls. Biological pathways that are usually required for proliferation 
of cells or microorganisms include, but are not limited to, cell division, DNA synthesis and 
replication, RNA synthesis (transcription), protein synthesis (translation), protein processing, 
protein transport, fatty acid biosynthesis, electron transport chains, cell wall synthesis, cell 
membrane production, synthesis and maintenance, and the like. 

10 By "inhibit activity of a gene or gene product" is meant having the ability to interfere with 

the function of a gene or gene product in such a way as to decrease expression of the gene, in such a 
way as to reduce the level or activity of a product of the gene or in such a way as to inhibit the 
interaction of the gene or gene product with other biological molecules required for its activity. 
Agents which inhibit the activity of a gene include agents that inhibit transcription of the gene, 

15 agents that inhibit processing of the transcript of the gene, agents that reduce the stability of the 
transcript of the gene, and agents that inhibit translation of the mRNA transcribed from the gene. 
In microorganisms, agents which inhibit the activity of a gene can act to decrease expression of the 
operon in which the gene resides or alter the folding or processing of operon RNA so as to reduce 
the level or activity of the gene product. The gene product can be a non-translated RNA such as 

20 ribosomal RNA, a translated RNA (mRNA) or the protein product resulting from translation of the 
gene mRNA. Of particular utility to the present invention are antisense RNAs that have activities 
against the operons or genes to which they specifically hybridze. 

By "activity against a gene product" is meant having the ability to inhibit the function or to 
reduce the level or activity of the gene product in a cell. This includes, but is not limited to, 

25 inhibiting the enzymatic activity of the gene product or the ability of the gene product to interact 
with other biological molecules required for its activity, including inhibiting the gene product's 
assembly into a multimeric structure. 

By "activity against a protein" is meant having the ability to inhibit the function or to 
reduce the level or activity of the protein in a cell. This includes, but is not limited to, inhibiting the 

30 enzymatic activity of the protein or the ability of the protein to interact with other biological 
molecules required for its activity, including inhibiting the protein's assembly into a multimeric 
structure. 

By "activity against a nucleic acid" is meant having the ability to inhibit the function or to 
reduce the level or activity of the nucleic acid in a cell. This includes, but is not limited to, 
35 inhibiting the ability of the nucleic acid interact with other biological molecules required for its 
activity, including inhibiting the nucleic acid's assembly into a multimeric structure. 



117 



WO 02/077183 PCT/US02/09107 

By "activity against a gene" is meant having the ability to inhibit the function or expression 

of the gene in a cell. This includes, but is not limited to, inhibiting the ability of the gene to interact 

with other biological molecules required for its activity. 

By "activity against an operon" is meant having the ability to inhibit the function or reduce 
5 the level of one or more products of the operon in a cell. This includes, but is not limited to, 

inhibiting the enzymatic activity of one or more products of the operon or the ability of one or more 

products of the operon to interact with other biological molecules required for its activity. 

By "antibiotic" is meant an agent which inhibits the proliferation of a cell or 

microorganism. 

10 By "E. coli or Escherichia coli" is meant Escherichia coli or any organism previously 

categorized as a species of Shigella including Shigella boydii, Shigella flexneri, Shigella 
dysenteriae, Shigella sonnei, Shigella 2A. 

By "homologous coding nucleic acid" is meant a nucleic acid homologous to a nucleic acid 
encoding a gene product whose activity or level is inhibited by a nucleic acid selected from the 

15 group consisting of SEQ ID NOs.: 1-6213 or a portion thereof. In some embodiments, the 
homologous coding nucleic acid may have at least 97%, at least 95%, at least 90%, at least 85%, at 
least 80%, or at least 70% nucleotide sequence identity to a nucleotide sequence selected from the 
group consisting of SEQ ID NOS.: 6214-42,397 and fragments comprising at least 10, 15, 20, 25, 
30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides thereof. In other 

20 embodiments the homologous coding nucleic acids may have at least 97%, at least 95%, at least 
90%, at least 85%, at least 80%, or at least 70% nucleotide sequence identity to a nucleotide 
sequence selected from the group consisting of the nucleotide sequences complementary to one of 
SEQ ID NOs.: 1-6213 and fragments comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 
200, 300, 400, or 500 consecutive nucleotides thereof. Identity may be measured using BLASTN 

25 version 2.0 with the default parameters or tBLASTX with the default parameters. (Altschul, S.F. et 
al. Gapped BLAST and PSI-BLAST: A New Generation of Protein Database Search Programs, 
Nucleic Acid Res. 25: 3389-3402 (1997). Alternatively a "homologuous coding nucleic acid" 
could be identified by membership of the gene of interest to a functional orthologue cluster. All 
other members of that orthologue cluster would be considered homologues. Such a library of 

30 functional orthologue clusters can be found at http://www.ncbi.nlm.nih.gov/COG. A gene can be 
classified into a cluster of orthologous groups or COG by using the COGNITOR program available 
at the above web site, or by direct BLASTP comparison of the gene of interest to the members of 
the COGs and analysis of these results as described by Tatusov, R.L., Galperin, M.Y., Natale, D. A. 
and Koonin, E.V. (2000) The COG database: a tool for genome-scale analysis of protein functions 

35 and evolution. Nucleic Acids Research v. 28 n. 1, pp33-36. 

Homologous coding nucleic acids and the homologous polypeptides which they encode 
may also be identified using a "reciprocal" best-hit analysis. To facilitate the identification of 
homologous coding nucleic acids and homologous polypeptides, paralogous genes within each of 
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51 organisms are identified and clustered prior to comparison to other organisms. Briefly, the 
polypeptide sequence of each polypeptide encoded by each open reading frame (ORF) in a given 
organism is compared to the polypeptide sequence encoded by every other ORF for that organism 
for each of the 51 pathogenic organisms (PathoSeq Sept 2001 release) using BLASTP 2.09 

5 algorithm without filtering. Simultaneously, the polypeptide sequence encoded by each ORF of an 
organism is compared to the polypeptide sequences encoded by each of the ORFs in the remaining 
51 organisms. Those polypeptides within a single organism that shared a higher degree of sequence 
identity to one another than to polypeptide sequences obtained from any other organisms are 
clustered as "paralog" sequences for "reciprocal" best-hit analysis. 

10 For each reference organism, the 50 homologous coding nucleic acids (and the 50 

homologous polypeptides which they encode) can be determined by identifying the ORFs in each of 
the 50 comparison organisms which encode a polypeptide sharing the highest degree of amino acid 
sequence identity to the polypeptide encoded by the ORF from the reference organism. The 
accuracy of the identification of the predicted homologous coding nucleic acids (and the 

15 homologous polypeptides which they encode) is confirmed by a "reciprocal" BLAST analysis in 
which the polypeptide sequence of the predicted homologous polypeptide is compared against the 
polypeptides encoded by each of the ORFS in the reference organism using BLASTP 2.09 
algorithm without filtering. Only those polypeptides that share the highest degree of amino acid 
sequence identity in each portion of the two-way comparison are retained for further analysis. 

20 The term "homologous coding nucleic acid" also includes nucleic acids comprising 

.nucleotide sequences which encode polypeptides having at least 99%, 95%, at least 90%, at least 
85%, at least 80%, at least 70%, at least 60%, at least 50%, at least 40% or at least 25% maino acid 
identity or similarity to a polypeptide comprising the amino acid sequence of one of SEQ ID NOs: 
42,398-78,581 or to a polypeptpide whose expression is inhibited by a nucleic acid comprising a 

25 nucleotide sequence of one of SEQ ID NOs: 1-6213 or fragments comprising at least 5, 10, 15, 20, 
25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof as determined using the FASTA 
version 3.0t78 algorithm with the default parameters. Alternatively, protein identity or similarity 
may be identified using BLASTP with the default parameters, BLASTX with the default 
parameters, TBLASTN with the default parameters, or tBLASTX with the default parameters. 

30 (Altschul, S.F. et al. Gapped BLAST and PSI-BLAST: A New Generation of Protein Database 
Search Programs, Nucleic Acid Res. 25: 3389-3402 (1997). 

Additionally, homologous coding nucleic acids and the homologous polypeptides which 
they encode may be identified using a "reciprocal" best-hit analysis. To facilitate the identification 
of homologous coding nucleic acids and homologous polypeptides, paralogous genes within each of 

35 51 organisms are identified and clustered prior to comparison, to other organisms. Briefly, the 
polypeptide sequence of each polypeptide encoded by each open reading frame (ORF) in a given 
organism is compared to the polypeptide sequence encoded by every other ORF for that organism 
for each of the 51 pathogenic organisms (PathoSeq Sept 2001 release) using BLASTP 2.09 
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algorithm without filtering. Simultaneously, the polypeptide sequence encoded by each ORF of an 
organism is compared to the polypeptide sequences encoded by each of the ORFs in the remaining 
5 1 organisms. Those polypeptides within a single organism that shared a higher degree of sequence 
identity to one another than to polypeptide sequences obtained from any other organisms are 
5 clustered as "paralog" sequences for "reciprocal" best-hit analysis. 

For each reference organism, the 50 homologous coding nucleic acids (and the 50 
homologous polypeptides which they encode) can be determined by identifying the ORFs in each of 
the 50 comparison organisms which encode a polypeptide sharing the highest degree of amino acid 
sequence identity to the polypeptide encoded by the ORF from the reference organism. The 

10 accuracy of the identification of the predicted homologous coding nucleic acids (and the 
homologous polypeptides which they encode) is confirmed by a "reciprocal" BLAST analysis in 
which the polypeptide sequence of the predicted homologous polypeptide is compared against the 
polypeptides encoded by each of the ORFS in the reference organism using BLASTP 2.09 
algorithm without filtering. Only those polypeptides that share the highest degree of amino acid 

1 5 sequence identity in each portion of the two-way comparison are retained for further analysis. 

The term "homologous coding nucleic acid" also includes coding nucleic acids which 
hybridize under stringent conditions to a nucleic acid selected from the group consisting of the 
nucleotide sequences complementary to one of SEQ ID NOS.: 6214-42,397 and coding nucleic 
acids comprising nucleotide sequences which hybridize under stringent conditions to a fragment 

20 comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive 
nucleotides of the sequences complementary to one of SEQ ED NOS.: 6214-42,397. As used 
herein, "stringent conditions" means hybridization to filter-bound nucleic acid in 6xSSC at about 
45°C followed by one or more washes in 0.1xSSC/0.2% SDS at about 68°C. Other exemplary 
stringent conditions may refer, e.g., to washing in 6xSSC/0.05% sodium pyrophosphate at 37°C, 

25 48°C, 55°C, and 60°C as appropriate for the particular probe being used. 

The term "homologous coding nucleic acid" also includes coding nucleic acids comprising 
nucleotide sequences which hybridize under moderate conditions to a nucleotide sequence selected 
from the group consisting of the sequences complementary to one of SEQ ID NOS.: 6214-42,397 
and coding nuclek acids comprising nucleotide sequences which hybridize under moderate 

30 conditions to a fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 
400, or 500 consecutive nucleotides of the sequences complementary to one of SEQ ID NOS.: 
6214-42,397. As ured herein, "moderate conditions" means hybridization to filter-bound DNA in 
6x sodium chloride/sodium citrate (SSC) at about 45°C followed by one or more washes in 
0.2xSSC/0.1% SDSat about 42-65°C. 

35 The term "homologous coding nucleic acids" also includes nucleic acids comprising 

nucleotide sequences which encode a gene product whose activity may be complemented by a gene 
encoding a gene product whose activity is inhibited by a nucleic acid comprising a nucleotide 
sequence selected iom the group consisting of SEQ ID NOs.: 1-6213. In some embodiments, the 
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homologous coding nucleic acids may encode a gene product whose activity is complemented by 
the gene product encoded by a nucleic acid comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NOS.: 6214-42,397. In other embodiments, the homologous coding 
nucleic acids may comprise a nucleotide sequence encode a gene product whose activity is 

5 complemented by one of the polypeptides of SEQ ID NOs. 42,398-78,581 . 

The term "homologous antisense nucleic acid" includes nucleic acids comprising a 
nucleotide sequence having at least 97%, at least 95%, at least 90%, at least 85%, at least 80%, or at 
least 70% nucleotide sequence identity to a nucleotide sequence selected from the group consisting 
of one of the sequences of SEQ ID NOS. 1-6213 and fragments comprising at least 10, 15, 20, 25, 

10 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides thereof. Homologous 
antisense nucleic acids may also comprising nucleotide sequences which have at least 97%, at least 
95%, at least 90%, at least 85%, at least 80%, or at least 70% nucleotide sequence identity to a 
nucleotide sequence selected from the group consisting of the sequences complementary to one of 
sequences of SEQ ID NOS.: 6214-42,397 and fragments comprising at least 10, 15, 20, 25, 30, 35, 

15 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides thereof. Nucleic acid identity 
may be determined as described above. 

The term "homologous antisense nucleic acid" also includes antisense nucleic acids 
comprising nucleotide sequences which hybridize under stringent conditions to a nucleotide 
sequence complementary to one of SEQ ID NOs.: 1-6213 and antisense nucleic acids comprising 

20 nucleotide sequences which hybridize under stringent conditions to a fragment comprising at least 
10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides of the 
sequence complementary to one of SEQ ID NOs. 1-6213. Homologous antisense nucleic acids also 
include antisense nucleic acids comprising nucleotide sequences which hybridize under stringent 
conditions to a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214- 

25 42,397 and antisense nucleic acids comprising nucleotide sequences which hybridize under 
stringent conditions to a fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 
300, 400, or 500 consecutive nucleotides of one of SEQ ID NOS.: 6214-42,397. 

The term "homologous antisense nucleic acid" also includes antisense nucleic acids 
comprising nucleotide sequences which hybridize under moderate conditions to a nucleotide 

30 sequence complementary to one of SEQ ID NOs.: 1-6213 and antisense nucleic acids comprising 
nucleotide seuqences which hybridize under moderate conditions to a fragment comprising at least 
10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides of the 
sequence complementary to one of SEQ ID NOs. 1-6213. Homologous antisense nucleic acids also 
include antisense nucleic acids comprising nucleotide seuqences which hybridize under moderate 

35 conditions to a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214- 
42,397 and antisense nucleic acids which comprising nucleotide sequences hybridize under 
moderate conditions to a fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 
200, 300, 400, or S00 consecutive nucleotides of one of SEQ ID NOS.: 6214-42,397. 
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By "homologous polypeptide" is meant a polypeptide homologous to a polypeptide whose 
activity or level is inhibited by a nucleic acid comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NOs.: 1-6213 or by a homologous antisense nucleic acid. The term 
"homologous polypeptide" includes polypeptides having at least 99%, 95%, at least 90%, at least 
5 85%, at least 80%, at least 70%, at least 60%, at least 50%, at least 40% or at least 25% amino acid 
identity or similarity to a polypeptide whose activity or level is inhibited by a nucleic acid selected 
from the group consisting of SEQ ID NOs: 1-6213 or by a homologous antisense nucleic acid, or 
polypeptides having at least 99%, 95%, at least 90%, at least 85%, at least 80%, at least 70%, at 
least 60%, at least 50%, at least 40% or at least 25% amino acid identity or similarity to a 

10 polypeptide to a fragment comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 
consecutive amino acids of a polypeptide whose activity or level is inhibited by a nucleic acid 
selected from the group consisting of SEQ ED NOs.: 1-6213 or by a homologous antisense nucleic 
acid. Identity or similarity may be determined using the FASTA version 3.0t78 algorithm with the 
default parameters. Alternatively, protein identity or similarity may be identified using BLASTP 

15 with the default parameters, BLASTX with the default parameters, or TBLASTN with the default 
parameters. (Altschul, S.F. et al. Gapped BLAST and PSI-BLAST: A New Generation of Protein 
Database Search Programs, Nucleic Acid Res. 25: 3389-3402 (1997). Additionally, homologous 
coding nucleic acids and the homologous polypeptides which they encode may be identified using a 
"reciprocal" best-hit analysis. To facilitate the identification of homologous coding nucleic acids 

20 and homologous polypeptides, paralogous genes within each of 51 organisms are identified and 
clustered prior to comparison to other organisms. Briefly, the polypeptide sequence of each 
polypeptide encoded by each open reading frame (ORF) in a given organism is compared to the 
polypeptide sequence encoded by every other ORF for that organism for each of the 51 pathogenic 
organisms (PathoSeq Sept 2001 release) using BLASTP 2.09 algorithm without filtering. 

25 Simultaneously, the polypeptide sequence encoded by each ORF of an organism is compared to the 
polypeptide sequences encoded by each of the ORFs in the remaining 51 organisms. Those 
polypeptides within a single organism that shared a higher degree of sequence identity to one 
another than to polypeptide sequences obtained from any other organisms are clustered as "paralog" 
sequences for "reciprocal" best-hit analysis. 

30 For each reference organism, the 50 homologous coding nucleic acids (and the 50 

homologous polypeptides which they encode) can be determined by identifying the ORFs in each of 
the 50 comparison organisms which encode a polypeptide sharing the highest degree of amino acid 
sequence identity to the polypeptide encoded by the ORF from the reference organism. The 
accuracy of the identification of the predicted homologous coding nucleic acids (and the 

35 homologous polypeptides which they encode) is confirmed by a "reciprocal" BLAST analysis in 
which the polypeptide sequence of the predicted homologous polypeptide is compared against the 
polypeptides encoded by each of the ORFS in the reference organism using BLASTP 2.09 
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algorithm without filtering. Only those polypeptides that share the highest degree of amino acid 
sequence identity in each portion of the two-way comparison are retained for further analysis. 

The term homologous polypeptide also includes polypeptides having at least 99%, 95%, at 
least 90%, at least 85%, at least 80%, at least 70%, at least 60%, at least 50%, at least 40% or at 
5 least 25% amino acid identity or similarity to a polypeptide selected from the group consisting of 
SEQ ID NOs: 42,398-78,581 and polypeptides having at least 99%, 95%, at least 90%, at least 
85%, at least 80%, at least 70%, at least 60%, at least 50%, at least 40% or at least 25% amino acid 
identity or similarity to a fragment comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 
150 consecutive amino acids of a polypeptide selected from the group consisting of SEQ ID NOs: 
10 42,398-78,581. 

The invention also includes polynucleotides, preferably DNA molecules, that hybridize to 
one of the nucleic acids of SEQ ID NOs.: 1-6213, SEQ ID NOs.: 6214-42,397 or the complements 
of any of the preceding nucleic acids. Such hybridization may be under stringent or moderate 
conditions as defined above or under other conditions which permit specific hybridization. The 
15 nucleic acid molecules of the invention that hybridize to these DNA sequences include 
oligodeoxynucleotides ("oligos") which hybridize to the target gene under highly stringent or 
stringent conditions. In general, for oligos between 14 and 70 nucleotides in length the melting 
temperature (Tm) is calculated using the formula: 

Tm (°C) = 81.5+ 16.6(log[monovalent cations (molar)] + 0.41 (% G+C) - (500/N) 
20 where N is the length of the probe. If the hybridization is carried out in a solution 

containing formamide, the melting temperature may be calculated using the equation: 

Tm(°C) == 81.5 + 16.6(log[monovalent cations (molar)] + 0.41(% G+C) - (0.61) 

(% formamide) - (500/N) 

where N is the length of the probe. In general, hybridization is carried out at about 20-25 
25 degrees below Tm (for DNA-DNA hybrids) or about 10-15 degrees below Tm (for RNA-DNA 
hybrids). 

Other hybridization conditions are apparent to those of skill in the art (see, for example, 
Ausubel, F.M. et aL, eds., 1989, Current Protocols in Molecular Biology, Vol. I, Green Publishing 
Associates, Inc. and John Wiley & Sons, Inc., New York, at pp. 63.1-6.3.6 and 2.10.3. 

30 The term, Salmonella, is the generic name for a large group of gram negative enteric 

bacteria that are closely related to Escherichia coli. The diseases caused by Salmonella are often 
due to contamination of foodstuffs or the water supply and affect millions of people each year. 
Traditional methods of Salmonella taxonomy were based on assigning a separate species name to 
each serologically distinguishable strain (Kauffmann, F 1966 The bacteriology of the 

35 Enterobacteriaceae. Munksgaard, Copenhagen). Serology of Salmonella is based on surface 
antigens (O [somatic] and H [flagellar]). Over 2,400 serotypes or serovars of Salmonella are 
known (Popoff, et al. 2000 Res. Microbiol. 151:63-65). Therefore, each serotype was considered to 
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be a separate species and often given names, accordingly (e.g. S. paratyphi, S. typhimurium, S. 

typhi, S. enteriditis, etc.). 

However, by the 1970s and 1980s it was recognized that this system was not only 

cumbersome, but also inaccurate. Then, many Salmonella sp.ecies were lumped into a single 
5 species (all serotypes and subgenera I, II, and IV and all serotypes of Arizona) with a second 

subspecies, S. bongorii also recognized (Crosa, et al., 1973, J. Bacteriol. 115:307-315). Though 

species designations are based on the highly variable surface antigens, the Salmonella are very 

similar otherwise with a major exception being pathogenicity determinants. 

There has been some debate on the correct name for the Salmonella species. Currently 
10 (Brenner, et al. 2000 J. Clin. Microbiol. 38:2465-2467), the accepted name is Salmonella enterica, 

S. enterica is divided into six subspecies (I, S. enterica subsp. enterica; II, & enterica, subsp. 

salamae; Ilia, S. enterica subsp. arizonae; Illb, S. enterica subsp. diarizonae; IV, S. enterica subsp. 

houtenae; and VI, S. enterica subsp. indica). Within subspecies I, serotypes are used to distinguish 

each of the serotypes or serovars (e.g. S. enterica serotype Enteriditis, S. enterica serotype 
15 Typhimurium, S. enterica serotype Typhi, and S. enterica serotype Choleraesuis, etc.). Current 

convention is to spell this out on first usage {Salmonella enterica ser. Typhimurium) and then use 

an abbreviated form (Salmonella Typhimurium or S. Typhimurium). Note, the genus and species 

names (Salmonella enterica) are italicized but not the serotype/serovar name (Typhimurium). 

Because the taxonomic committees have yet to officially approve of the actual species name, this 
20 latter system is what is employed by the CDC (Brenner, et al. 2000 J. Clin. Microbiol. 38:2465- 

2467). Due to the concerns of both taxonomic priority and medical importance, some of these 

serotypes might ultimately receive full species designations (S.typhi would be the most notable). 

Therefore, as used herein "Salmonella enterica or 5. enterica" includes serovars Typhi, 

Typhimurium, Paratyphi, Choleraesuis, etc." However, appeals of the "official" name are in process 
25 and the taxonomic designations may change (S. choleraesuis is the species name that could replace 

S. enterica based solely on priority). 

By "identifying a compound" is meant to screen one or more compounds in a collection of 

compounds such as a combinatorial chemical library or other library of chemical compounds or to 

characterize a single compound by testing the compound in a given assay and determining whether 
30 it exhibits the desired activity. 

By "inducer" is meant an agent or solution which, when placed in contact with a cell or 

microorganism, increases transcription, or inhibitor and/or promoter clearance/fidelity, from a 

desired promoter. 

As used herein, "nucleic acid" means DNA, RNA, or modified nucleic acids. Thus, the 
35 terminology "the nucleic acid of SEQ ID NO: X" or "the nucleic acid comprising the nucleotide 
sequence" includes both the DNA sequence of SEQ ID NO: X and an RNA sequence in which the 
thymidines in the DNA sequence have been substituted with uridines in the RNA sequence and in 
which the deoxyribose backbone of the DNA sequence has been substituted with a ribose backbone 
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in the RNA sequence. Modified nucleic acids are nucleic acids having nucleotides or structures 
which do not occur in nature, such as nucleic acids in which the internucleotide phosphate residues 
with methylphosphonates, phosphorothioates, phosphoramidates, and phosphate esters. 
Nonphosphate internucleotide analogs such as siloxane bridges, carbonate brides, thioester bridges, 

5 as well as many others known in the art may also be used in modified nucleic acids. Modified 
nucleic acids may also comprise, a-anomeric nucleotide units and modified nucleotides such as 1,2- 
dideoxy-d-ribofiiranose, 1,2-dideoxy-l-phenylribofuranose, and TV 4 , j^-ethano-S-methyl-cytosine 
are contemplated for use in the present invention. Modified nucleic acids may also be 
peptide nucleic acids in which the entire deoxyribose-phosphate backbone has been exchanged with a 

10 chemically completely different, but structurally homologous, polyamide (peptide) backbone 
containing 2-aminoethyl glycine units. 

As used herein, "sub-lethal" means a concentration of an agent below the concentration 

required to inhibit all cell growth. 

Brief Description of the Drawings 
15 Figure 1A illustrates a method for replacing a promoter using a promoter replacement 

cassette comprising a 5 ' region homologous to the sequence which is 5' of the natural promoter in 
the chromosome, the promoter which is to replace the chromosomal promoter and a 3 5 region which 
is homologous to sequences 3 ' of the natural promoter in the chromosome. 

Figure IB illustrates a method for replacing a promoter using a promoter replacement 
20 cassette comprising a nucleic acid encoding an identifiable or selectable marker disposed between 
the 5' region which is homologous to the sequence 5' of the natural promoter and the promoter 
which is to replace the chromosomal promoter and a transcriptional terminator 3' of the gene 
encoding an identifiable or selectable marker. 

Figures 2A and 2B illustrate one method for identifying amplification products which are 
25 underrepresented or overrepresented in a culture. 

Figures 3A and 3B illustrate another method for identifying amplification products which 
are underrepresented or overrepresented in a culture. 

Figure 4 illustrates the results of a hybridization analysis where the antisense nucleic acid 
expressed by a strain in the culture is not complementary to all or a portion of the gene encoding the 
30 target of the compound (i.e. a nonspecific strain). 

Figure 5 illustrates the results of a hybridization analysis where the antisense nucleic acid 
expressed by a strain in the culture is complementary to all or a portion of the gene encoding the 
target of the compound, the hybridization intensity for that strain will be intimately correlated with 
the concentration of the compound (i.e. a specific strain). 
35 Figure 6 illustrates an oligonucleotide comprising a lac operator flanked on each side by 40 

nucleotides homologous to the promoter is the promoter which drives expression of the yabB yabC 
ftsL ftsImurE genes in an operon for use in inserting the lac operator into the promoter. 
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Figure 7 is an IPTG dose response curve in E, coli transformed with an IPTG-inducible 
plasmid containing either an antisense clone to the E. coli ribosomal protein rpPN (AS-rplW) which 
is required for protein synthesis and essential for cell proliferation, or an antisense clone to the elaD 
(AS-elaD) gene which is not known to be involved in protein synthesis and which is also essential 
5 for proliferation. 

Figure 8A is a tetracycline dose response curve in E. coli transformed with an IPTG- 
inducible plasmid containing antisense to rplW (AS-rplW) in the absence (0) or presence of IPTG at 
concentrations that result in 20% and 50% growth inhibition. 

Figure 8B is a tetracycline dose response curve in E. coli transformed with an EPTG- 
10 inducible plasmid containing antisense to elaD (AS-elaD)in the absence (0) or presence of IPTG at 
concentrations that result in 20% and 50% growth inhibition. 

Figure 9 is a graph showing the fold increase in tetracycline sensitivity of E. coli 
transfected with antisense clones to essential ribosomal proteins L23 (AS-rplW) and L7/L12 and 
L10 (AS-rplLrpU). Antisense clones to genes known to not be directly involved in protein 
15 synthesis, atpB/E (AS-atpB/E ), visC (AS-v/sQ, elaD (AS-elaD), yohH (AS-yohH), are much less 
sensitive to tetracycline. 

Figure 10 illustrates the results of an assay in which Staphylococcus aureus cells 
transcribing an antisense nucleic acid complementary to the gyrB gene encoding the P subunit of 
gyrase were contacted with several antibiotics whose targets were known. 
20 Figure 1 1 illustrates a microtitration plate which contains antibiotic and inducer at gradient 

concentrations in a matrix format in 10 times excess quantity. 

Figure 12 illustrates the results of an experiment demonstrating that at appropriate 
concentrations of inducer, cells which overexpress the defB gene product were able to grow at 
elevated concentrations of the antibiotic actinonin 
25 Figure 13 illustrates the results of an experiment demonstrating that at appropriate 

concentrations of inducer cells which overexpress the folA gene product were able to grow at 
elevated concentrations of the antibiotic trimethoprim. 

Figure 14 illustrates the results of an experiment demonstrating that overexpression of the 
fabl gene confers resistance to triclosan, which acts on the gene product of the fabl gene, but does 
30 not confer resistance to cerulenin, trimethoprim, or actinonin, each of which act on other gene 
products. 

Figure 15 illustrates the results of an experiment demonstrating that overexpression of the 
folA gene confers resistance to trimethoprim, which acts on the gene product of the folA gene but 
does not confer resistance to triclosan, cerulenin, or actinonin, each of which act on other gene 
35 products. 

Figure 16 illustrates the results of an experiment demonstrating that overexpression of the 
defB gene conferred resistance to actinonin, which acts on the gene product of the defB gene but 
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does not confer resistance to cerulenin, trimethoprim, or triclosan, each of which act on other gene 
products. 

Figure 17 illustrates the results of an experiment demonstrating that overexpression of the 
fabF gene conferred resistance to cerulenin, which acts on the gene product of the fabF gene, P 
5 keto-acyl carrier protein synthase but does not confer resistance to triclosan, trimethoprim, or 
actinonin, each of which act on other gene products. 

Figure 18 illustrates the results of experiments in which a mixture of nine strains was 
grown wells in a 96 well plate in medium containing various concentrations of inducer and a 
sufficient concentration of actinonin, cerulenin, triclosan or trimethoprim to inhibit the growth of 

10 strains which do not overexpress the targets of these antibiotics. 

Detailed Description of Embodiments of the Invention 
The present invention describes a group of prokaryotic genes and gene families required for 
cellular proliferation. Exemplary genes and gene families from Escherichia coli, Staphylococcus 
aureus, Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas aeruginosa, Salmonella 

15 typhimurium, Acinetobacter baumannii, Bacillus anthracis, Bacteroides fragilis, Bordetella 
pertussis, Borrelia burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkholderia 
mallei, Campylobacter jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium 
acetobutylicum, Clostridium botulinum, Clostridium difficile, Corynebacterium diptheriae, 
Enterobacter cloacae, Enterococcus faecium, Haemophilus influenzae, Helicobacter pylori, 

20 Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, 
Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma 
genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella 
multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, 
Salmonella typhi, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 

25 mutans, Streptococcus pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholera and Yersinia pestis are provided. A proliferation-required gene or gene 
family is one where, in the absence or substantial reduction of a gene transcript and/or gene product, 
growth or viability of the cell or microorganism is reduced or eliminated. Thus, as used herein, the 
terminology proliferation-required" or "required for proliferation" encompasses instances where the 

30 absence or substantial reduction of a gene transcript and/or gene product completely eliminates cell 
growth as well as instances where the absence of a gene transcript and/or gene product merely reduces 
cell growth. These proliferation-required genes can be used as potential targets for the generation of 
new antimicrobial agents. To achieve that goal, the present invention also encompasses assays for 
analyzing proliferation-required genes and for identifying compounds which interact with the gene 

35 and/or gene products of the proliferation-required genes. In. addition, the present invention 
contemplates the expression of genes and the purification of the proteins encoded by the nucleic acid 
sequences identified as required proliferation genes and reported herein. The purified proteins can be 
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used to generate reagents and screen small molecule libraries or other candidate compound libraries for 
compounds that can be further developed to yield novel antimicrobial compounds. 

The present invention also describes methods for identification of nucleotide sequences 
homologous to these genes and polypeptides described herein, including nucleic acids comprising 

5 nucleotide sequences homologous to the nucleic acids of SEQ ID NOS.: 6214-42397 and 
polypeptides homologous to the polypeptides of SEQ ID NOs.: 42398-78581. For example, these 
sequences may be used to identify homologous coding nucleic acids, homologous antisense nucleic 
acids, or homologous polypeptides in microorganisms such as Acinetobacter baumannii, Anaplasma 
marginale, Aspergillus fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, 

10 Borrelia burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkholderia mallei, 
Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), 
Candida tropicalis, Candida parapsilosis, Candida guillierrnondii, Candida Icrusei, Candida kefyr 
(also called Candida pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia 
trachomatis, Clostridium acetobutylicum, Clostridium boiulinum, Clostridium difficile, Clostridium 

15 perfringens, Coccidioides immitis, Corynebacterium diptheriae, Cryptococcus neoformans, 
Enterobacter cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, 
Haemophilus influenzae, Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, 
Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, 
Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma 

20 genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia 
asteroides, Pasteurella haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus 
mirabilis, Proteus vulgaris, Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas 
syringae, Salmonella bongori, Salmonella cholerasnis, Salmonella enterica, Salmonella paratyphi, 
Salmonella typhi, Salmonella typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, 

25 Shigella sonnei, Staphylococcus aureus, Staphylococcus epidennidis, Staphylococcus haemolyticus, 
Streptococcus pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, 
Ureaplasma urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia 
enterocolitica, Yersinia pestis or any species falling within the genera of any of the above species. 
In some embodiments, the homologous coding nucleic acids, homologus antisense nucleic acids, or 

30 homologous polypeptides are identified in an organism other than E. coll 

The homologous coding nucleic acids, homologous antisense nucleic acids, or homologous 
polypeptides, may then be used in each of the methods described herein, including methods of 
identifying compounds which inhibit the proliferation of the organism containing the homologous 
coding nucleic acid, homologous antisense nucleic acid or homologous polypeptide, methods of 

35 inhibiting the growth of the organism containing the homologous coding nucleic acid, homologus 
antisense nucleic acid or homologous polypeptide, methods of identifying compounds which 
influence the activity or level of a gene product required for proliferation of the organism 
containing the homologous coding nucleic acid, homologous antisense nucleic acid or homologous 
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polypeptide, methods for identifying compounds or nucleic acids having the ability to reduce the 
level or activity of a gene product required for proliferation of the organism containing the 
homologous coding nucleic acid, homologous antisense nucleic acid or homologous polypeptide, 
methods of inhibiting the activity or expression of a gene in an operon required for proliferation of 

5 the organism containing the homologous coding nucleic acid, homologous antisense nucleic acid or 
homologous polypeptide, methods for identifying a gene required for proliferation of the organism 
containing the homologous coding nucleic acid, homologous antisense nucleic acid or homologous 
polypeptide, methods for identifying the biological pathway in which a gene or gene product 
required for proliferation of the organism containing the homologous coding nucleic acid, 

10 homologous antisense nucleic acid or homologous polypeptide lies, methods for identifying 
compounds having activity against biological pathway required for proliferation of the organism 
containing the homologous coding nucleic acid, homologous antisense nucleic acid or homologous 
polypeptide, methods for determining the biological pathway on which a test compound acts in the 
organism containing the homologous coding nucleic acid, homologous antisense nucleic acid or 

15 homologous polypeptide, methods of replacing an endogenous promoter with a regulatable 
promoter which controls the expression of the homologous coding nucleic acid, homologous 
antisense nucleic acid or homologous polypeptide, methods of inserting an operator within or near 
an endogenous promoter to provide regulatable expression of the homologous coding nucleic acid, 
homologous antisense nucleic acid or homologous polypeptide, methods of identifying the target on 

20 which a compound acts in the organism containing the homologous coding nucleic acid, 
homologous antisense nucleic acid or homologous polypeptide, and methods of inhibiting the 
proliferation of the organism containing the homologous coding nucleic acid, homologous antisense 
nucleic acid or homologous polypeptide in a subject. In some embodiments of the present 
invention, the methods are performed using an organism, other than E. coli or a gene or gene 

25 product from an organism other than E. coli. 

One embodiment of the present invention utilizes a novel method to identify proliferation- 
required sequences. Generally, a library of nucleic acid sequences from a given source are subcloned 
or otherwise inserted immediately downstream of an inducible promoter on an appropriate vector, such 
as a Staphylococcus aureus/E. coli or Pseudomonas aeruginosa/ E. coli shuttle vector, or a vector 

30 which will replicate in both Salmonella typhimurium and Klebsiella pneumoniae, or other vector or 
shuttle vector capable of functioning in the intended organism, thus forming an expression library. It is 
generally preferred that expression is directed by a regulatable promoter sequence such that expression 
level can be adjusted by addition of variable concentrations of an inducer molecule or of an inhibitor 
molecule to the medium. For example, a number of regulatable promoters useful for regulating the 

35 expression of nucleic acid sequences over a wide range of expression levels are described in U.S. 
Patent Application ' Serial. Number 10/032,393, filed December 21, 2001. Temperature activated 
promoters, such as promoters regulated by temperature sensitive repressors, such as the lambda Ci 8 57 
repressor, are also envisioned. Although the insert nucleic acids may be derived from the chromosome 
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of the cell or microorganism into which the expression vector is to be introduced, because the insert is 
not in its natural chromosomal location, the insert nucleic acid is an exogenous nucleic acid for the 
purposes of the discussion herein. The term "expression" is defined as the production of a sense or 
antisense RNA molecule from a gene, gene fragment, genomic fragment, chromosome, operon or 
5 portion thereof. Expression can also be used to refer to the process of peptide or polypeptide synthesis. 
An expression vector is defined as a vehicle by which a ribonucleic acid (RNA) sequence is transcribed 
from a nucleic acid sequence carried within the expression vehicle. The expression vector can also 
contain features that permit translation of a protein product from the transcribed RNA message 
expressed from the exogenous nucleic acid sequence carried by the expression vector. Accordingly, an 
10 expression vector can produce an RNA molecule as its sole product or the expression vector can 
produce a RNA molecule that is ultimately translated into a protein product 

Once generated, the expression library containing the exogenous nucleic acid sequences is 
introduced into a population of cells (such as the organism from which the exogenous nucleic acid 
sequences were obtained) to search for genes that are required for bacterial proliferation. Because the 
15 library molecules are foreign, in context, to the population of cells, the expression vectors and the 
nucleic acid segments contained therein are considered exogenous nucleic acid. 

Expression of the exogenous nucleic, acid fragments in the test population of cells containing 
the expression library is then activated. Activation of the expression vectors consists of subjecting the 
cells containing the vectors to conditions that result in the expression of the exogenous nucleic acid 
20 sequences carried by the expression library. The test population of cells is then assayed to determine 
the effect of expressing the exogenous nucleic acid fragments on the test population of cells. Those 
expression vectors that negatively impact the growth of the cells upon induction of expression of the 
random sequences contained therein are identified, isolated, and purified for further study. 

In some embodiments, vectors which comprises a regulatable fusion promoter selected 
25 from a suite of fusion promoters, wherein the promoter suite is useful for modulating both the basal 
and maximal levels of transcription of a nucleic acid over a wide dynamic range thus allowing the 
desired level of production of a transcript, can be used to express exogenous nucleic acids, 
including the nucleic acids of the present invention. Such promoters are described in U.S. Patent 
Application Serial Number 10/032,393, filed December 21, 2001, the disclosure of which is 
30 incorported herein by reference in its entirety. 

In some other embodiments, vectors useful for the production of stabilized mRNA having 
an increased lifetime (including antisense RNA) in Gram negative organisms are described in U.S. 
Provisional Patent Application Serial Number 60/343,512, filed December 21, 2001. Briefly, the 
stabilized antisense RNA may comprise an antisense RNA which was identified as inhibiting 
35 proliferation as described above which has been engineered to contain at least one stem loop 
flanking each end of the antisense nucleic acid. In some embodiments, the at least one stem-loop 
structure formed at the 5' end of the stabilized antisense nucleic acid comprises a flush, double 
stranded 5' end. In some embodiments, one or more of the stem loops comprises a rho independent 
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terminator. In additicml embodiments, the stabilized antisense RNA lackis a ribosome binding site. 
In further embodiment, the stabilized RNA lacks sites which are cleaved by one or more RNAses, 
such as RNAse E or RNAse III. In some embodiments, the stabilized antisense RNA may be 
transcribed in a cell vhich the activity of at least one enzyme involved in RNA degradation has 

5 been reduced. For example, the activity of an enzyme such as RNase E, RNase II, RNase HI, 
polynucleotide phosphorylase, and poly(A) polymerase, RNA helicase, enolase or an enzyme 
having similar functions may be reduced in the cell. 

Alternatively, genes required for proliferation may be identified by replacing the natural 
promoter for the proliferation required gene with a regulatable promoter as described above. The 

10 growth of such strain under conditions in which the promoter is active or non-repressed is 
compared to the growth under conditions in which the promoter is inactive or repressed. If the 
strains fail to grow or grow at a substantially reduced rate under conditions in which the promoter is 
inactive or repressed but grow normally under conditions in which the promoter is active or non- 
repressed, then the gene which is operably linked to the regulatable promoter encodes a gene 

15 product required for proliferation. For example, proliferation-required genes and gene products 
identified using promoter replacement are described in U.S. Patent Application Serial Number 
09/948,993. 

For example, in some embodiments, the natural promoter may be replaced using techniques 
which employ homologous recombination to exchange a promoter present on the chromosome of 

20 the cell with the desired promoter. In such methodology, a nucleic acid comprising a promoter 
replacement cassette is introduced into the cell. As illustrated in Figure 1A, the promoter 
replacement cassette comprises a 5' region homologous to the sequence which is 5' of the natural 
promoter in the chromosome, the promoter which is to replace the chromosomal promoter and a 3' 
region which is homologous to sequences 3 s of the natural promoter in the chromosome. In some 

25 embodiments, the promoter replacement cassette may also include a nucleic acid encoding an 
identifiable or selectable marker disposed between the 5* region which is homologous to the 
sequence 5' of the natural promoter and the promoter which is to replace the chromosomal 
promoter. If desired, the promoter replacement cassette may also contain a transcriptional 
terminator 3' of the gene encoding an identifiable or selectable marker, as illustrated in Figure IB. 

30 As illustrated in Figure 1A and IB, homologous recombination is allowed to occur between the 
chromosomal region containing the natural promoter and the promoter replacement cassette. Cells 
in which the promoter replacement cassette has integrated into the chromosome are identified or 
selected. To confirm that homologous recombination has occurred, the chromosomal structure of 
the cells may be verified by Southern analysis or PCR. 

35 In some embodiments, the promoter replacement cassette may be introduced into the cell as 

a linear nucleic acid, such a PCR product or a restriction fragment. Alternatively, the promoter 
replacement may be introduced into the cell on a plasmid. Figures 1A and IB illustrates the 
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replacement of a chromosomal promoter with a desired promoter through homologous 
recombination. 

In some embodiments, the cell into which the promoter replacement cassette is introduced 
may carry mutations which enhance its ability to be transformed with linear DNA or which enhance 

5 the frequency of homologous recombination. For example, if the cell is an Escherichia coli cell it 
may have a mutation in the gene encoding Exonuclease V of the RecBCD recombination complex. 
If the cell is an Escherichia coli cell it may have a mutation that activates the RecET recombinase 
of the Rac prophage and/or a mutation that enhances recombination through the RecF pathway. For 
example, the Escherichia coli cells may be RecB or RecC mutants carrying an sbcA or sbcB 

10 mutation. Alternatively, the Escherichia coli cells may be recD mutants. In other embodiments the 
Escherichia coli cells may express the X Red recombination genes. For example, Escherichia coli 
cells suitable for use in techniques employing homologous recombination have been described in 
Datsenko, K.A. and Wanner, B.L., PNAS 97:6640-6645 (2000); Murphy, K.C., J. Bact 180: 2053- 
2071 (1998); Zhang, Y., et al., Nature Genetics 20: 123-128 (1998); and Muyrers, J.P.P. et al., 

15 Genes & Development 14: 1971-1982 (2000). It will be appreciated that cells carrying mutations in 
similar genes may be constructed in organisms other than Escherichia coli. 

In some embodiments of the present invention, a regulatable fusion promoter selected from 
a suite of fusion promoters, wherein the promoter suite is useful for modulating both the basal and 
maximal levels of transcription of a nucleic acid over a wide dynamic range thus allowing the 

20 desired level of production of a transcript, is with the promoter replacement methods described 
above. Such promoters are described in U.S. Patent Application Serial Number 10/032,393, filed 
December 2 1 , 200 1 , the disclosure of which is incorported herein by reference in its entirety. 

A variety of assays are contemplated to identify nucleic acid sequences that negatively impact 
growth upon expression. In one embodiment, growth in cultures expressing exogenous nucleic acid 

25 sequences and growth in cultures not expressing these sequences is compared. Growth measurements 
are assayed by examining the extent of growth by measuring optical densities. Alternatively, 
enzymatic assays can be used to measure bacterial growth rates to identify exogenous nucleic acid 
sequences of interest. Colony size, colony morphology, and cell morphology are additional factors 
used to evaluate growth of the host cells. Those cultures that fail to grow or grow at a reduced rate 

30 under expression conditions are identified as containing an expression vector encoding a nucleic acid 
fragment that negatively affects a proliferation-required gene. 

Once exogenous nucleic acids of interest are identified, they are analyzed. The first step of the 
analysis is to acquire the nucleotide sequence of the nucleic acid fragment of interest. To achieve this 
end, the insert in those expression vectors identified as containing a nucleotide sequence of interest is 

35 sequenced, using standard techniques well known in the art. The next step of the process is to 
determine the source of the nucleotide sequence. As used herein "source" means the genomic region 
containing the cloned fragment. 
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Determination of the gene(s) corresponding to the nucleotide sequence is achieved by 
comparing the obtained sequence data with databases containing known protein and nucleotide 
sequences from various microorganisms. Thus, initial gene identification is made on the basis of 
significant sequence similarity or identity to either characterized or predicted Escherichia coli, 

5 Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas aeruginosa, 
and Salmonella typhimurium genes or their encoded proteins and/or homologues in other species. 

The number of nucleotide and protein sequences available in database systems has been 
growing exponentially for years. For example, the complete nucleotide sequences of Caenorhabditis 
elegans and several bacterial genomes, including E. coli, Aeropyrum pemix, Aquifex aeolicus, 

10 Archaeoglobus fulgidus, Bacillus subtilis, Borrelia burgdorferi, Chlamydia pneumoniae, 
Chlamydia trachomatis, Clostridium ietani, Corynebacterium diptheria, Deinococcus radiodurans, 
Haemophilus influenzae, Helicobacter pylori 26695, Helicobacter pylon J99, Methanobacterium 
thermoautotrophicum, Methanococcus jannaschii, Mycobacterium tuberculosis, Mycoplasma 
genitalium, Mycoplasma pneumoniae, Pseudomonas aeruginosa, Pyrococcus abyssi, Pyrococcus 

15 horikoshii, Rickettsia prowazekii, Synechocystis PCC6803, Tliermotoga maritima, Treponema 
pallidum, Bordetella pertussis, Campylobacter jejuni, Clostridium acetobutylicum, Mycobacterium 
tuberculosis CSUU93, Neisseria gonorrhoeae, Neisseria meningitidis, Pseudomonas aeruginosa, 
Pyrobaculum aerophilum, Pyrococcus furiosus, Rhodobacter capsulatus, Salmonella typhimurium, 
Streptococcus mutans, Streptococcus pyogenes, Ureaplasma urealyticum and Vibrio cholera are 

20 available. This nucleotide sequence information is stored in a number of databanks, such as GenBank, 
the National Center for Biotechnology Information (NCBI), the Genome Sequencing Center 
(http^/genome.wusd.edu/gsc/salmonella.shtml), and the Sanger Centre 

fhttp^/www.sanger.ac.uk/proiects/S typhi) which are publicly available for searching. A variety 
of computer programs are available to assist in the analysis of the sequences stored within these 

25 databases. FASTA, (W. R. Pearson (1990) "Rapid and Sensitive Sequence Comparison with 
FASTP and FASTA" Methods in Enzymology 183:63- 98), Sequence Retrieval System (SRS), 
(Etzold & Argos, SRS an indexing and retrieval tool for flat file data libraries. Comput. Appl. 
Biosci. 9:49-57, 1993) are two examples of computer programs that can be used to analyze 
sequences of interest In one embodiment of the present invention, the BLAST family of computer 

30 programs, which includes BLASTN version 2.0 with the default parameters, or BLASTX version 
2.0 with the default parameters, is used to analyze nucleotide sequences. 

BLAST, an acronym for "Basic Local Alignment Search Tool," is a family of programs for 
database similarity searching. The BLAST family of programs includes: BLASTN, a nucleotide 
sequence database searching program, BLASTX, a protein database searching program where the input 

35 is a nucleic acid sequence; and BLASTP, a protein database searching program. BLAST programs 
embody a fast al#)rithm for sequence matching, rigorous statistical methods for judging the 
significance of matches, and various options for tailoring the program for special situations. Assistance 
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in using the program can be obtained by e-mail at blast@ncbi.nlm.nih.gov. tBLASTX can be used to 
translate a nucleotide sequence in all three potential reading frames into an amino acid sequence. 

Bacterial genes are often transcribed in polycistronic groups. These groups comprise operons, 
which are a collection of genes and intergenic sequences under common regulation. The genes of an 
5 operon are transcribed on the same mRNA and are often related functionally. Given the nature of the 
screening protocol, it is possible that the identified exogenous nucleic acid corresponds to a gene or 
portion thereof with or without adjacent noncoding sequences, an intragenic sequence (i.e. a sequence 
within a gene), an intergenic sequence (i.e. a sequence between genes), a nucleotide sequence spanning 
at least a portion of two or more genes, a 5' noncoding region or a 3' noncoding region located 
1 0 upstream or downstream from the actual nucleotide sequence that is required for bacterial proliferation. 
Accordingly, it is often desirable to determine which gene(s) that is encoded within the operon is 
individually required for proliferation. 

In one embodiment of the present invention, an operon is identified and then dissected to 
determine which gene or genes are required for proliferation. Operons can be identified by a 
15 variety of means known to those in the art. For example, the RegulonDB DataBase described by 
Huerta et al. (Nuci Acids Res. 26:55-59, 1998), which may also be found on the website 
http://www.cifh.unam.mx/Computational_Biology/regulondb/, provides information about operons 
in Escherichia coli. The Subtilist database (http://bioweb.pasteur.fr/GenoList/SubtiList), ( Moszer, 
L, Glaser, P. and Danchin, A. (1995) Microbiology 141: 261-268 and Moszer, I (1998) FEBS 
20 Letters 430: 28-36, may also be used to predict operons. This database lists genes from the fully 
sequenced, Gram positive bacteria, Bacillus subtilis, together with predicted promoters and 
terminator sites. This information can be used in conjunction with the Staphylococcus aureus 
genomic sequence data to predict operons and thus produce a list of the genes affected by the 
antisense nucleic acids of the present invention. The Pseudomonas aeniginosa web site 
25 (http://www.pseudomonas.com) can be used to help predict operon organization in this bacterium. 
The databases available from the Genome Sequencing Center 
(http://genome.wustl.edu/gsc/salmonella.shtml), and the Sanger Centre 

(http://www.sanger.ac.ulc/projects/S typhi) may be used to predict operons in Salmonella 

typhimurium. The TIGR microbial database has an incomplete version of the E.faecalis genome 
30 http://www.tigr.org/cgi-bin/BlastSearcl^last.cgi?organism=e_faecalis. One can take a nucleotide 
sequence and BLAST it for homologs. 

A number of techniques that are well known in the art can be used to dissect the operon. 
Analysis of RNA transcripts by Northern blot or primer extension techniques are commonly used to 
analyze operon transcripts. In one aspect of this embodiment, gene disruption by homologous 
35 recombination is used to individually inactivate the genes of an operon that is thought to contain a gene 
required for proliferation. 

Several gene disruption techniques have been described for the replacement of a functional 
gene with a mutated, non-functional (null) allele. These techniques generally involve the use of 
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homologous recombination. One technique using homologous recombination in Staphylococcus 
aureus is described in Xia et a.. 1999, Plasmid 42: 144-149. This technique uses crossover PCR to 
create a null allele with an in-frame deletion of the coding region of a target gene. The null allele is 
constructed in such a way that nucleotide sequences adjacent to the wild type gene are retained. 

5 These homologous sequences surrounding the deletion null allele provide targets for homologous 
recombination so that the wild type gene on the Staphylococcus aureus chromosome can be 
replaced by the constructed null allele. This method can be used with other bacteria as well, 
including Salmonella and Klebsiella species. Similar gene disruption methods that employ the 
counter selectable marker sacB (Schweizer, H. P., Klassen, T. and Hoang, T. (1996) Mol. Biol, of 

10 Pseudomonas. ASM press, 229-237, are available for Pseudomonas, Salmonella and Klebsiella 
- species. E. faecalis genes can be disrupted by recombining in a non-replicating plasmid that 
contains an internal fragment to that gene (Leboeuf, C, L. Leblanc, Y. Auffray and A. Hartke. 
2000. J. Bacteriol. 182:5799-5806. 

The crossover PCR amplification product is subcloned into a suitable vector having a 

15 selectable marker, such as a drug resistance marker. In some embodiments the vector may have an 
origin of replication which is functional in E. coli or another organism distinct from the organism in 
which homologous recombination is to occur, allowing the plasmid to be grown in E. coli or the 
organism other than that in which homologous recombination is to occur, but may lack an origin of 
replication functional in Escherichia coli Staphylococcus aureus, Enterococcus faecalis, Klebsiella 

20 pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter baumannii, 
Bacillus anthrads, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorfeii, Burkholderia 
cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Chlamydia 
pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostiidium botulinum, 
Clostridium difficile, Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus faecium, 

25 Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria monocytogenes, 
Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, Mycobacterium leprae, 
Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria 
gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus mirabilis, Pseudomonas 
putida, Pseudomonas syringae, Salmonella paratyphi, Sabnonella typhi, Staphylococcus 

30 epidermidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus pneumoniae, 
Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio cholerae or 
Yersinia pestis such that selection of the selectable marker requires integration of the vector into the 
homologous region of the Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, 
Klebsiella pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter 

35 baumannii, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorfeii, 
Burkholderia cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, 
Chlamydia pneumoniae, Chlamydia trachomatis, Clostiidium acetobutylicum, Clostridium 
botulinum, Clostridium difficile, Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus 
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faecium, Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus 

5 mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, 
Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus 
pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio 
cholerae or Yersinia pestis chromosome. Usually a single crossover event is responsible for this 
integration event such that the Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, 

10 Klebsiella pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter 
baumannii, Bacillus anthracis, Bacteroides fi'agilis, Bordetella pertussis, Borrelia burgdorferi, 
Burkholderia cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, 
Chlamydia pneumoniae, Chlamydia trachomatis, Clostiidium acetobutylicum, Clostridium 
botulinum, Clostridium difficile, Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus 

15 faecium, Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeiia 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus 
mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, 

20 Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus 
pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio 
cholera or Yersinia pestis chromosome now contains a tandem duplication of the target gene 
consisting of one wild type allele and one deletion null allele separated by vector sequence. 
Subsequent resolution of the duplication results in both removal of the vector sequence and either 

25 restoration of the wild type gene or replacement by the in-frame deletion. The latter outcome will 
not occur if the gene should prove essential. A more detailed description of this method is provided 
in Example 10 below. It will be appreciated that this method may be practiced with any of the 
nucleic acids or organisms described herein. 

Recombinant DNA techniques can be used to express the entire coding sequences of the gene 

30 identified as required for proliferation, or portions thereof. The over-expressed proteins can be used as 
reagents for further study. The identified exogenous sequences are isolated, purified, and cloned into a 
suitable expression vector using methods well known in the art. If desired, the nucleic acids can 
contain the nucleotide sequences encoding a signal peptide to facilitate secretion of the expressed 
protein. 

35 Expression of fragments of the bacterial genes identified as required for proliferation is also 

contemplated by the present invention. The fragments of the identified genes can encode a polypeptide 
comprising at least 5, at least 10, at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at 
least 45, at least 50, at least 55, at least 60, at least 65, at least 75, or more than 75 consecutive amino 
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acids of a gene complementary to one of the identified sequences of the present invention. The nucleic 
acids inserted into the expression vectors can also contain endogenous sequences upstream and 
downstream of the coding sequence. 

When expressing the encoded protein of the identified nucleic acid required for bacterial 

5 proliferation or a fiagment thereof, the nucleic acid to be expressed is operably linked to a promoter in 
an expression vector using conventional cloning technology. The expression vector can be any of the 
bacterial, insect, yeast, or mammalian expression systems known in the art. Commercially available 
vectors and expression systems are available from a variety of suppliers including Genetics Institute 
(Cambridge, MA), Stratagene (La Jolla, California), Promega (Madison, Wisconsin), and Invitrogen 

10 (San Diego, California). If desired, to enhance expression and facilitate proper protein folding, the 
codon usage and codon bias of the sequence can be optimized for the particular expression organism in 
which the expression vector is introduced, as explained by Hatfield, et al, U.S. Patent No. 5,082,767, 
incorporated herein by this reference. Fusion protein expression systems are also contemplated by the 
present invention. 

15 Following expression of the protein encoded by the identified exogenous nucleic acid, the 

protein may be purified. Protein purification techniques are well known in the art. Proteins encoded 
and expressed from identified exogenous nucleic acids can be partially purified using precipitation 
techniques, such as precipitation with polyethylene glycol. Alternatively, epitope tagging of the protein 
can be used to allow simple one step purification of the protein. In addition, chromatographic methods 

20 such as ion-exchange chromatography, gel filtration, use of hydroxyapaptite columns, immobilized 
reactive dyes, chromatofocusing, and use of high-performance liquid chromatography, may also be 
used to purify the protein. Electrophoretic methods such as one-dimensional gel electrophoresis, high- 
resolution two-dimensional polyacrylamide electrophoresis, isoelectric focusing, and others are 
contemplated as purification methods. Also, affinity chromatographic methods, comprising antibody 

25 columns, ligand presenting columns and other affinity chromatographic matrices are contemplated as 
purification methods in the present invention. 

The purified proteins produced from the gene encoding sequences identified as required for 
proliferation can be used in a variety of protocols to generate useful antimicrobial reagents. In one 
embodiment of the present invention, antibodies are generated against the proteins expressed from the 

30 identified exogenoic nucleic acids. Both monoclonal and polyclonal antibodies can be generated 
against the expressed proteins. Methods for generating monoclonal and polyclonal antibodies are well 
known in the art Also, antibody fragment preparations prepared from the produced antibodies 
discussed above are contemplated. 

In addition, the purified protein, fragments thereof, or derivatives thereof may be administered 

35 to an individual in a pharmaceutically acceptable carrier to induce an immune response against the 
protein. Preferably, the immune response is a protective immune response which protects the 
individual. Methods for determining appropriate dosages of the protein and pharmaceutically 
acceptable carriers may be determined empiracally and are familiar to those skilled in the art. 
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Another application for the purified proteins of the present invention is to screen small 
molecule libraries for candidate compounds active against the various target proteins of the present 
invention. Advances in the field of combinatorial chemistry provide methods, well known in the art, to 
produce large numbers of candidate compounds that can have a binding, or otherwise inhibitory effect 
5 on a target protein. Accordingly, the screening of small molecule libraries for compounds with binding 
affinity or inhibitory activity for a target protein produced from an identified gene is contemplated by 
the present invention. 

In some embodiments of the present invention, a cell sensitized by expressing an an antisense 
nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 

10 1-6213, an antisense nucleic acid comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 
300, 400, or 500 consecutive nucleotides of a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 1-6213, a nucleic acid complementary to a nucleic acid comprising a 
nucleotide sequence selected from the group consisting of SEQ ID NOs.: 6214-42397, a nucleic 
acid complementary to a nucleic acid comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 

15 200, 300, 400, or 500 consecutive nucleotides of a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 6214-42397, a nucleic acid complementary to a nucleic acid which 
encodes a polypeptide comprising an amino acid sequence selected from the group, consisting of 
SEQ ED NOs.: 42398-78581, a nucleic acid complementary to a nucleic acid which encodes at least 
5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids of a polypeptide sequence 

20 selected from the group consisting of SEQ ID NOs.: 42398-78581, a homologous antisense nucleic 
acid, an antisense nucleic acid comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 
300, 400, or 500 consecutive nucleotides of a homologous nucleic acid, a nucleic acid 
complementary to a homologous coding nucleic acid, a nucleic acid complementary to at least 10, 
15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides of a 

25 homologous coding nucleic acid, a nucleic acid complementary to a nucleic acid which encodes a 
homologous polypeptide, or a nucleic acid complementary to a nucleic acid which encodes at least 
5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids of a homologous 
polypeptide, is contacted with one or more candidate compounds from a small molecule library. 
Candidate compounds which further inhibit the proliferation of the sensitized cell may be identified 

30 as possessing inhibitory activity for a target protein or product produced by the gene to which the 
antisense sequence is complementary. 

A number of vectors useful in the above methods are described in U.S. Patent Application 
Serial Number 10/032,393, filed December 21, 2001. 

In some embodiments of the present invention, the methods for the production of stabilized 

35 RNA, as described in U.S. Patent Application Serial Number, 60/343,512, can be used for the 
production of a stabilized transcript, which corresponds to a nucleic acid described herein, having 
an increased lifetime in Gram-negative organisms. Briefly, the stabilized antisense RNA may 
comprise an antisense RNA which was identified as inhibiting proliferation as described above 
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which has been engineered to contain at least one stem loop flanking each end of the antisense 
nucleic acid. In some embodiments, the at least one stem-loop structure formed at the 5' end of the 
stabilized antisense nucleic acid comprises a flush, double stranded 5' end. In some embodiments, 
one or more of the stem loops comprises a rho independent terminator. In additional embodiments, 

5 the stabilized antisense RNA lacks a ribosome binding site. In further embodiments, the stabilized 
RNA lacks sites which are cleaved by one or more RNAses, such as RNAse E or RNAse HL In 
some embodiments, the stabilized antisense RNA may be transcribed in a cell which the activity of 
at least one enzyme involved in RNA degradation has been reduced. For example, the activity of 
an enzyme such as RNase E, RNase II, RNase HI, polynucleotide phosphorylase, and poly(A) 

10 polymerase, RNA helicase, enolase or an enzyme having similar functions may be reduced in the 
cell. 

The present invention further contemplates utility against a variety of other pathogenic 
microorganisms in addition to Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, 
Klebsiella pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacler 

15 baumannii, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, 
Burkholderia cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, 
Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium 
botulinum, Clostridium difficile, Corynebactehum diptheriae, Enterobacter cloacae, Enterococcus 
faecium, Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria 

20 monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis. Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus 
mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, 
Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus 

25 pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio 
cholerae and Yersinia pestis. For example, homologous coding nucleic acids, homologous antisense 
nucleic acids or homologous polypeptides from other pathogenic microorganisms (including nucleic 
acids homologous to the nucleic acids of SEQ ID NOs.: 6214-42397, nucleic acids homologous to 
the antisense nucleic acids of SEQ ID NOs.: 1-6213, and polypeptides homologous to the 

30 polypeptides of SEQ ID NOs.: 42398-78581) may be identified using methods such as those 
described herein. The homologous coding nucleic acids, homologous antisense nucleic acids or 
homologous polypeptides may be used to identify compounds which inhibit the proliferation of these 
other pathogenic microorganisms using methods such as those described herein. 

For example, the proliferation-required nucleic acids, antisense nucleic acids, and 

35 polypeptides from Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella 
pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter baumannii, 
Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia 
cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Chlamydia 
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pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium botulinum, 
Clostridium difficile, Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus faecium, 
Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria monocytogenes, 
Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, Mycobacterium leprae, 

5 Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria 
gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus mirabilis, Pseudomonas 
putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, Staphylococcus 
epidennidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus pneumoniae, 
Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio cholerae or 

10 Yersinia pestis described herein (including the nucleic acids of SEQ ID NOs.: 6214-42397, the 
antisense nucleic acids of SEQ ID NOs: 1-6213, and the polypeptides of SEQ ED NOs.: 42398- 
78581) may be used to identify homologous coding nucleic acids, homologous antisense nucleic acids 
or homologous polypeptides required for proliferation in prokaryotes and eukaryotes. For example, 
nucleic acids or polypeptides required for the proliferation of protists, such as Plasmodium spp./ 

15 plants; animals, such as Entamoeba spp. and Contracaecum spp; and fungi including Candida spp., 
(e.g., Candida albicans), Qyptococcus neoformans, and Aspergillus fumigatus may be identified. In 
one embodiment of the present invention, monera, specifically bacteria, including both Gram positive 
and Gram negative bacteria, are probed in search of novel gene sequences required for proliferation. 
Likewise, homologous antisense nucleic acids which may be used to inhibit growth of these organisms* 

20 or to identify antibiotics may also be identified. These embodiments are particularly important given 
the rise of drug resistant bacteria. 

The number of bacterial species that are becoming resistant to existing antibiotics is growing. 
A partial list of these microorganisms includes: Escherichia spp., such as E. coli, Enterococcus spp, 
such as E. faecalis; Pseudomonas spp, such as P. aeruginosa, Clostridium spp., such as C. 

25 botulinum, Haemophilus spp., such as H influenzae, Enterobacter spp., such as E. cloacae, Vibrio 
spp., such as V. cholera; Moraxala spp., such as M. catarrhalis', Streptococcus spp., such as S. 
pneumoniae, Neisseria spp., such as N gonorrhoeae; Mycoplasma spp., such as Mycoplasma 
pneumoniae; Salmonella typhimurium\ Helicobacter pylori; Escherichia coli; and Mycobacterium 
tuberculosis. The genes and polypeptides identified as required for the proliferation of Escherichia 

30 coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas 
aeruginosa, Salmonella typhimurium, Acinetobacter baumannii, Bacillus anthracis, Bacteroides 
fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, Burkholderia fungomm, 
Burkholderia mallei, Campylobacter jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, 
Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Corynebacterium 

35 diptheriae, Enterobacter cloacae, Enterococcus faecium, Haemophilus influenzae, Helicobacter 
pylori, Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium 
avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma 
genitalium, Mycoplasma pneumoniae, Neisseria gonoirhoeae, Neisseria meningitidis, Pasteurella 
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multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, 
Salmonella typhi, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
mutans, Streptococcus pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae or Yersinia pestis (including the nucleic acids of SEQ ID NOs.: 6214- 

5 42397, the sequences complementary to the nucleic acids of SEQ ID NOs.: 6214-42397, and the 
polypeptides of SEQ ID NOs.: 42398-78581) can be used to identify homologous coding nucleic 
acids or homologous polypeptides required for proliferation from these and other organisms using 
methods such as nucleic acid hybridization and computer database analysis. Likewise, the 
antisense nucleic acids which inhibit proliferation of Escherichia coli, Staphylococcus aureus, 

10 Enter ococcusfaecalis, Klebsiella pneumoniae, Pseudomonas aemginosa, Salmonella typhimurium, 
Acinetobacter baumannii, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia 
burgdorferi, Burkholderia cepacia, Burkholderia fungorum, BurJcJiolderia mallei, Campylobacter 
jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium 
botulinum, Clostridium difficile, Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus 

15 faecium, Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus 
mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, 

20 Staphylococcus epideimidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus 
pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio 
cholerae or Yersinia pestis (including the antisense nucleic acids of SEQ ID NOs.: 1-6213 or the 
sequences complementary thereto) may also be used to identity antisense nucleic acids which 
inhibit proliferation of these and other microorganisms or cells using nucleic acid hybridization or 

25 computer database analysis. 

In one embodiment of the present invention, the nucleic acid sequences from Escherichia coli, 
Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas aeruginosa, 
Salmonella typhimurium, Acinetobacter baumannii, Bacillus anthracis, Bacteroides fragilis, 
Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, Burkholderia fungorum, 

30 Burkholderia mallei, Campylobacter jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, 
Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Corynebacterium 
diptheriae, Enterobacter cloacae, Enterococcus faecium, Haemophilus influenzae, Helicobacter 
pylori, Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium 
avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma 

35 genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella 
multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, 
Salmonella typhi, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
mutans, Streptococcus pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
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urealyticum, Vibrio cholerae or Yersinia pestis (including the nucleic acids of SEQ ID NOs.: 6214- 
42397 and the antisense nucleic acids of SEQ ID NOs. 1-6213) are used to screen genomic libraries 
generated from Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella 
pneumoniae, Pseudomonas aeruginosa, Salmonella typhimunum, Acinetobacter baumannii, 

5 Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia 
cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Chlamydia 
pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium botulinum, 
Clostridium difficile, Coiynebacterium diptheriae, Enterobacter cloacae, Enterococcus faecium, 
Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria monocytogenes, 

10 Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, Mycobacterium leprae, 
Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria 
gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus mirabilis, Pseudomonas 
putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, Staphylococcus 
epidermidis, Staphylococcus haemolyticus, Sti'eptococcus mutans, Streptococcus pneumoniae, 

15 Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio cholerae, Yersinia 
pestis and other bacterial species of interest. For example, the genomic library may be from Gram 
positive bacteria, Gram negative bacteria or other organisms including Acinetobacter baumannii, 
Anaplasma marginale, Aspergillus fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella 
pertussis, Borrelia burgdorferi, BurkJiolderia cepacia, Burldiolderia fungorum, Burkholderia 

20 mallei, Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis 
glabrata), Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida Icrusei, 
Candida kefyr (also called Candida pseudotropicalis), Candida dubliniensis, Chlamydia 
pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium botulinum, 
Clostridium diffudle, Clostridium perfringens, Coccidioides immitis, Corynebacterium diptheriae, 

25 Cryptococcus neoformans, Enterobacter cloacae, Enterococcus faecalis, Enterococcus faecium, 
Escherichia coli, Haemophilus influenzae, Helicobacter pylori, Histoplasma capsulatum, Klebsiella 
pneumoniae, Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, 
Mycobacterium avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, 
Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria gonoirhoeae, Neisseria meningitidis, 

30 Nocardia asteroides, Pasteurella haemolytica, Pasteurella multocida, Pneumocystis carinii, 
Proteus mirabilis, Proteus vulgaris, Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas 
syringae, Salmonella bongori, Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, 
Salmonella typhi, Salmonella typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, 
Shigella sonnei, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, 

35 Streptococcus pneumoniae, Sti'eptococcus mutans, Streptococcus pyogenes, Treponema pallidum, 
Ureaplasma urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia 
enterocolitica, Yersinia pestis or any species falling within the genera of any of the above species, 
including coagulase negative species of Staphylococcus. In some embodiments, the genomic 
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library may be from an organism other than E. coli. Standard molecular biology techniques are used 
to generate genomic libraries from various cells or microorganisms. In one aspect, the libraries are 
generated and bound to nitrocellulose paper. The identified exogenous nucleic acid sequences of the 
present invention can then be used as probes to screen the libraries for homologous sequences. 
5 For example, the libraries may be screened to identify homologous coding nucleic, acids or 

homologous antisense nucleic acids comprising nucleotide sequences which hybridize under 
stringent conditions to a nucleic acid selected from the group consisting of SEQ ID NOs.: 1-6213, 
nucleic acids comprising nucleotide sequences which hybridize under stringent conditions to a 
fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 
10 consecutive nucleotides of one of SEQ ID NOs. 1-6213, nucleic acids comprising nucleotide 
sequences which hybridize under stringent conditions to a nucleic acid complementary to one of 
SEQ ID NOs. 1-6213, nucleic acids comprising nucleotide sequences which hybridize under 
stringent conditions to a fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 
300, 400, or 500 consecutive nucleotides of the sequence complementary to one of SEQ ID NOs. 1- 
15 6213, nucleic acids comprising nucleotide sequences which hybridize under stringent conditions to 
a nucleic acid selected from the group consisting of SEQ ID NOS.: 6214-42397, nucleic acids 
comprising nucleotide sequences which hybridize under stringent conditions to a fragment 
comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive 
nucleotides of one of SEQ ID NOS.: 6214-42397, nucleic acids comprising nucleotide sequences 
20 which hybridize under stringent conditions to a nucleic acid complementary to one of SEQ ID 
NOS.: 6214-42397, nucleic acids comprising nucleotide sequences which hybridize under stringent 
conditions to a fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 
400, or 500 consecutive nucleotides of the sequence complementary to one of SEQ ID NOS.: 6214- 
42397. 

25 The libraries may also be screened to identity homologous nucleic coding nucleic acids or 

homologous antisense nucleic acids comprising nucleotide sequences which hybridize under 
moderate conditions to a nucleic acid selected from the group consisting of SEQ ID NOs.: 1-6213, 
nucleic acids comprising nucleotide sequences which hybridize under moderate conditions to a 
fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 

30 consecutive nucleotides of one of SEQ ID NOs. 1-6213, nucleic acids comprising nucleotide 
sequences which hybridize under moderate conditions to a nucleic acid complementary to one of 
SEQ ID NOs. 1-6213, nucleic acids comprising nucleotide sequences which hybridize under 
moderate conditions to a fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 
200, 300, 400, or 500 consecutive nucleotides of the sequence complementary to one of SEQ ID 

35 NOs. 1-6213, nucleic acids comprising nucleotide sequences which hybridize under moderate 
conditions to a nucleic acid selected from the group consisting of SEQ ID NOS.: 6214-42397, 
nucleic acids comprising nucleic acid sequences which hybridize under moderate conditions to a 
fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 
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consecutive nucleotides of one of SEQ ID NOS.: 6214-42397, nucleic acids comprising nucleotide 
sequences which hybridize under moderate conditions to a nucleic acid complementary to one of 
SEQ ID NOS.: 6214-42397 and nucleic acids comprising nucleotide sequences which hybridize 
under moderate conditions to a fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 
5 150, 200, 300, 400, or 500 consecutive nucleotides of the sequence complementary to one of SEQ 
ID NOS.: 6214-42397. 

The homologous coding nucleic acids, homologous antisense nucleic acids or homologous 
polypeptides identified as above can then be used as targets or tools for the identification of new, 
antimicrobial compounds using methods such as those described herein. In some embodiments, the 

10 homologous coding nucleic acids, homologous antisense nucleic acids, or homologous polypeptides 
may be used to identify compounds with activity against more than one microorganism. [Placeholder] 

For example, the preceding methods may be used to isolate homologous coding nucleic 
acids or homologous antisense nucleic acids comprising a nucleotide sequence with at least 97%, at 
least 95%, at least 90%, at least 85%, at least 80%, or at least 70% nucleotide sequence identity to a 

1 5 nucleotide sequence selected from the group consisting of one of the sequences of SEQ ID NOS. 1 - 
6213, fragments comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 
consecutive nucleotides thereof, and the sequences complementary thereto. The preceding methods 
may also be used to isolate homologous coding nucleic acids or homologous antisense nucleic acids 
comprising a nucleotide sequence with at least 97%, at least 95%, at least 90%, at least 85%, at 

20 least 80%, or at least 70% nucleotide sequence identity to a nucleotide sequence selected from the 
group consisting of one of the nucleotide sequences of SEQ ID NOS.: 6214-42397, fragments 
comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive 
nucleotides thereof, and the sequences complementary thereto. Identity may be measured using. 
BLASTN version 2.0 with the default parameters. (Altschul, S.F. et al. Gapped BLAST and PSI- 

25 BLAST: A New Generation of Protein Database Search Programs, Nucleic Acid Res. 25: 3389- 
3402 (1997). For example, the homologous polynucleotides may comprise a coding sequence 
which is a naturally occurring allelic variant of one of the coding sequences described herein. Such 
allelic variants may have a substitution, deletion or addition of one or more nucleotides when 
compared to the nucleic acids of SEQ ID NOs: 1-6213, SEQ ID NOS.: 6214-42397 or the 

30 nucleotide sequences complementary thereto. 

Additionally, the above procedures may be used to isolate homologous coding nucleic acids 
which encode polypeptides having at least 99%, 95%, at least 90%, at least 85%, at least 80%, at 
least 70%, at least 60%, at least 50%, at least 40% or at least 25% amino acid identity or similarity 
to a polypeptide comprising the sequence of one of SEQ ID NOs: 42398-78581 or to a polypeptpide 

35 whose expression is inhibited by a nucleic acid of one of SEQ ID NOs: 1-6213 or fragments 
comprising at least'5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof 
as determined using the FASTA version 3.0t78 algorithm with the default parameters. 
Alternatively, protein identity or similarity may be identified using BLASTP with the default 
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parameters, BLASTX with the default parameters, or TBLASTN with the default parameters. 
(Altschul, S.F. et al. Gapped BLAST and PSI-BLAST: A New Generation of Protein Database 
Search Programs, Nucleic Acid Res. 25: 3389-3402 (1997). 

Alternatively, homologous coding nucleic acids, homologous antisense nucleic acids or 

5 homologous polypeptides may be identified by searching a database to identify sequences having a . 
desired level of nucleotide or amino acid sequence homology to a nucleic acid or polypeptide 
involved in proliferation or an antisense nucleic acid to a nucleic acid involved in microbial 
proliferation. A variety of such databases are available to those skilled in the art, including 
GenBank and GenSeq. In some embodiments, the databases are screened to identify nucleic acids 

10 with at least 97%, at least 95%, at least 90%, at least 85%, at least 80%, or at least 70% nucleotide 
sequence identity to a nucleic acid required for proliferation, an antisense nucleic acid which 
inhibits proliferation, or a portion of a nucleic acid required for proliferation or a portion of an 
antisense nucleic acid which inhibits proliferation. For example, homologous coding sequences 
may be identified by using a database to identify nucleic acids homologous to one of SEQ ID Nos. 

15 1-6213, homologous to fragments comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 
200, 300, 400, or 500 consecutive nucleotides thereof, nucleic acids homologous to one of SEQ ID 
NOS.: 6214-42397, homologous to fragments comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 
100, 150, 200, 300, 400, or 500 consecutive nucleotides of one of SEQ ID NOS.: 6214-42397,. 
nucleic acids homologous to one of SEQ ID Nos. 1-6213, homologous to fragments comprising at 

20 least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides thereof 
or nucleic acids homologous to the sequences complementary to any of the preceding nucleic acids. 
In other embodiments, the databases are screened to identify polypeptides having at least 99%, 
95%, at least 90%, at least 85%, at least 80%, at least 70%, at least 60%, at least 50%, at least 40% 
or at least 25% amino acid sequence identity or similarity to a polypeptide involved in proliferation 

25 or a portion thereof. For example, the database may be screened to identify polypeptides 
homologous to a polypeptide comprising one of SEQ ID NOs: 42398-78581, a polypeptide whose 
expression is inhibited by a nucleic acid of one of SEQ ID NOs: 1-6213 or homologous to 
fragments comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino 
acids of any of the preceding polypeptides. In some embodiments, the database may be screened to 

30 identify homologous coding nucleic acids, homologous antisense nucleic acids or homologous 
polypeptides from cells or microorganisms other than the Escherichia coli, Staphylococcus aureus, 
Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas aemginosa, Salmonella typhimurium, 
Acinetobacter baumannii, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borreiia 
burgdorferi, Burkholderia cepacia, BurkJiolderia fungorum, Burkholderia mallei, Campylobacter 

35 jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium 
botulinum, Clostridium difficile, Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus 
faecium, Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
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Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus 
mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, 
Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus 

5 pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio 
cholerae or Yersinia pestis species from which they were obtained. For example the database may 
be screened to identify homologous coding nucleic acids, homologous antisense nucleic acids or 
homologous polypeptides from microorganisms such as Acinetobacter baumannii, Anaplasma 
marginale, Aspergillus fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, 

10 Borrelia burgdorferi, Burkholderia cepacia, Burkholderia Jungorum, Burkholderia mallei, 
Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), 
Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida krusei, Candida kefyr 
(also called Candida pseudotropicalis\ Candida dubliniensis, Chlamydia pneumoniae, Chlamydia 
trachomatis, Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium 

15 perfringens, Coccidioides immitis, Corynebacterium diptheriae, Cryptococcus neoformans, 
Enterobacter cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, 
Haemophilus influenzae, Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, 
Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, 
Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma 

20 genitalium, Mycoplasma pneumoniae, Neisseria gonoirhoeae, Neisseria meningitidis, Nocardia 
asteroides, Pasteurella haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus 
mirabilis, Proteus vulgaris, Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas 
syringae, Salmonella bongori, Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, 
Salmonella typhi, Salmonella typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, 

25 Shigella sonnei, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, 
Streptococcus pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, 
Ureaplasma urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia 
enterocolitica, Yersinia pestis or any species falling within the genera of any of the above species, 
including coagulase negative Staphylococcus . In some embodiments, the homologous coding 

30 nucleic acids, homologous antisense nucleic acids, or homologous polypeptides are from an 
organism other than E. coli. 

In another embodiment, gene expression arrays and microarrays can be employed. Gene 
expression arrays are high density arrays of DNA samples deposited at specific locations on a glass 
chip, nylon membrane, or the like. Such arrays can be used by researchers to quantify relative gene 

35 expression under different conditions. Gene expression arrays are used by researchers to help 
identify optimal drug targets, profile new compounds, and determine disease pathways. An 
example of this technology is found in U.S. Patent No. 5,807,522. 
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It is possible to study the expression of all genes in the genome of a particular microbial 
organism using a single array. For example, the arrays may consist of 12 x 24 cm nylon filters 
containing PCR products corresponding to ORFs from Escherichia coli, Staphylococcus aureus, 
Enterococcusfaecalis, Klebsiella pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium, 
5 Acinetobacter baumannii, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia 
burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter 
jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, Clostiidium acetobutylicum, Clostridium 
botulinum, Clostridium difficile, Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus 
faecium, Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria 
10 monocytogenes, Moraxella catarrlialis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus 
mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, 
Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus 
15 pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio 
cholerae or Yersinia pestis (including the nucleic acids of SEQ ID NOs.: 6214-42397) . 10 ngs of 
each PCR product are spotted every 1.5 mm on the filter. Single stranded labeled cDNAs are 
prepared for hybridization to the array (no second strand synthesis or amplification step is done) 
and placed in contact with the filter. Thus the labeled cDNAs are of "antisense" orientation. 
20 Quantitative analysis is done by phosphorimager. 

Hybridization of cDNA made from a sample of total cell mRNA to such an array followed 
by detection of binding by one or more of various techniques known to those in the art results in a 
signal at each location on the array to which cDNA hybridized. The intensity of the hybridization 
signal obtained at each location in the array thus reflects the amount of mRNA for that specific 
25 gene that was present in the sample. Comparing the results obtained for mRNA isolated from cells 
grown under different conditions thus allows for a comparison of the relative amount of expression 
of each individual gene during growth under the different conditions. 

Gene expression arrays may be used to analyze the total mRNA expression pattern at 
various time points after induction of an antisense nucleic acid complementary to a proliferation- 
30 required gene. Analysis of the expression pattern indicated by hybridization to the array provides 
information on other genes whose expression is influenced by antisense expression. For example, if 
the antisense is complementary to a gene for ribosomal protein L7/L12 in the 50S subunit, levels of 
other mRNAs may be observed to increase, decrease or stay the same following expression of 
antisense to the L7/L12 gene. If the antisense is complementary to a different 50S subunit 
35 ribosomal protein mRNA (e.g. L25), a different mRNA expression pattern may result. Thus, the 
mRNA expression "pattern observed following expression of an antisense nucleic acid comprising a 
nucleotide sequence complementary to a proliferation required gene may identify other 
proliferation-required nucleic acids. In addition, the mRNA expression patterns observed when the 
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bacteria are exposed to candidate drug compounds or known antibiotics may be compared to those 
observed with antisense nucleic acids comprising a nucleotide sequence complementary to a 
proliferation-required nucleic acid. If the mKNA expression pattern observed with the candidate 
drug compound is similar to that observed with the antisense nucleic acid, the drug compound may 
5 be a promising therapeutic candidate. Thus, the assay would be useful in assisting in the selection 
of promising candidate drug compounds for use in drug development. 

In cases where the source of nucleic acid deposited on the array and the source of the 
nucleic acid being hybridized to the array are from two different cells or microorganisms, gene 
expression arrays can identify homologous nucleic acids in the two cells or microorganisms. 
10 The present invention also contemplates additional methods for screening other 

microorganisms for proliferation-required genes. In one aspect of this embodiment, an antisense 
nucleic acid comprising a nucleotide sequence complementary to the proliferation-required sequences 
from Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, 
Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter baumannii, Bacillus anthracis, 
15 Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burlcholderia cepacia, 
Burkholderia fungowm, Burlcholderia mallei, Campylobacter jejuni, Chlamydia pneumoniae, 
Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, 
Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus faecium, Haemophilus 
influenzae, Helicobacter pylori, Legionella pneumophila, Listeria monocytogenes, Moraxella 
20 catarrhalis, Mycobacterium avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium 
tuberculosis, Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria 
meningitidis, Pasteurella multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas 
syringae, Salmonella paratyphi, Salmonella typhi, Staphylococcus epidermidis, Staphylococcus 
haemolyticus, Streptococcus mutans, Streptococcus pneumoniae, Streptococcus pyogenes, 
25 Treponema pallidum, Ureaplasma urealyticum, Vibrio cholerae or Yersinia pestis, or a portion 
thereof, is transcribed in an antisense orientation in such a way as to alter the level or activity of a 
nucleic acid required for proliferation of an autologous or heterologous cell or microorganism. For 
example, the antisense nucleic acid may be a homologous antisense nucleic acid such as an antisense 
nucleic acid homologous to the nucleotide sequence complementary to one of SEQ ID NOs.: 6214- 
30 42397, an antisense nucleic acid comprising a nucleotide sequence homologous to one of SEQ ID 
Nos.: 1-6213, or an antisense nucleic acid comprising a nucleotide sequence complementary to a 
portion of any of the preceding nucleic acids. The cell or microorganism transcribing the homologous 
antisense nucleic acid may be used in a cell-based assay, such as those described herein, to identify 
candidate antibiotic compounds. In another embodiment, the conserved portions of nucleotide 
35 sequences identified as proliferation-required can be used to generate degenerate primers for use in the 
polymerase chain reaction (PCR). The PCR technique is well known in the art. The successful 
production of a PCR product using degenerate primers generated from the nucleotide sequences 
identified herein indicates the presence of a homologous gene sequence in the species being screened. 
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This homologous gene is then isolated, expressed, and used as a target for candidate antibiotic 
compounds. In another aspect of this embodiment, the homologous gene (for example a homologous 
coding nucleic acid) thus identified, or a portion thereof, is transcribed in an autologous cell or 
microorganism or m a heterologous cell or microorganism in an antisense orientation in such a way as 
5 to alter the level or activity of a homologous gene required for proliferation in the autologous or 
heterologous cell or microorganism. Alternatively, a homologous antisense nucleic acid may be 
transcribed in an autologous or heterologous cell or microorganism in such a way as to alter the level or 
activity of a gene product required for proliferation in the autologous or heterologous cell or 
microorganism. 

10 The nucleic acids homologous to the genes required for the proliferation of Escherichia 

coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas 
aeruginosa, Salmonella typhimurium, Acinetobacter baumannii, Bacillus anthracis, Bacteroides 
fragilis, Bordetelia pertussis, Borrelia burgdorferi, Burkholderia cepacia, Burkholderia fungorum, 
Burkliolderia mallei, Campylobacter jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, 

15 Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Corynebacterium 
diptheriae, Enterobacter cloacae, Enterococcus faecium, Haemophilus influenzae, Helicobacter 
pylori, Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium 
avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma 
genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella 

20 multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, 
Salmonella typhi, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
mutans, Streptococcus pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae or Yersinia pestis or the sequences complementary thereto may be 
used to identify homologous coding nucleic acids or homologous antisense nucleic acids from cells 

25 or microorganisms other than Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, 
Klebsiella pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter 
baumannii, Bacillus anthracis, Bacteroides fragilis, Bordetelia pertussis, Borrelia burgdorferi, 
Burkliolderia cepacia, Burldiolderia fungorum, Burkholderia mallei, Campylobacter jejuni, 
Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium 

30 botulinum, Clostridium difficile, Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus 
faecium, Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus 

35 mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, 
Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus mutans, Su-eptococcus 
pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio 
cholerae or Yersinia pestis to inhibit the proliferation of cells or microorganisms other than 
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Escherichia coli, Staphylococcus aureus, Entei'ococcus faecalis, Klebsiella pneumoniae, 
Pseudomonas aen&nosa, Salmonella typhimurium, Acinetobacter baumannii, Bacillus anthracis, 
Bacteroides fragts, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burkholderia fun&rum, Burkholderia mallei, Campylobacter jejuni, Chlamydia pneumoniae, 

5 Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, 
Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus faecium, Haemophilus 
influenzae, Helicobacter pylori, Legionella pneumophila, Listeria monocytogenes, Moraxella 
catarrhal, Mycdkacterium avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium 
tuberculosis, Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria 

10 meningitidis, Pasteurella multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas 
syringae, Salmonella paratyphi, Salmonella typhi, Staphylococcus epidermidis, Staphylococcus 
haemolyticus, Streptococcus mutans, Streptococcus pneumoniae, Streptococcus pyogenes, 
Treponema pallidum, Ureaplasma urealyticum, Vibrio cholerae or Yersinia pestis by inhibiting the 
activity or reducii^ the amount of the identified homologous coding nucleic acid or homologous 

15 polypeptide in the cell or microorganism other than Escherichia coli, Staphylococcus aureus, 
Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium, 
Acinetobacter bcaanannii, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia 
burgdorferi, Burtikolderia cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter 
jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium 

20 botulinum, Clostridium difficile, Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus 
faecium, Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella midtocida, Proteus 

25 mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, 
Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus 
pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio 
cholerae or Yersinia pestis or to identify compounds which inhibit the growth of cells or 
microorganisms other than Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, 

30 Klebsiella pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter 
baumannii, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, 
Burkholderia cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, 
Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium 
botulinum, Clostridium difficile, Coiynebacterium diptheriae, Enterobacter cloacae, Enterococcus 

35 faecium, Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, . Pasteurella midtocida, Proteus 

150 
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mirabilis, Pseudomonas putida, Pseudomonas syiingae, Salmonella paratyphi, Salmonella typhi, 
Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus 
pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio 
cholerae or Yersinia pestis as described below. For example, the nucleic acids homologous to 

5 proliferation-required genes from Escherichia coli t Staphylococcus aureus, Enter ococcus faecalis, 
Klebsiella pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter 
baumannii, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, 
BurkJiolderia cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, 
Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium 

10 botulinum, Clostridium difficile, Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus 
faecium, Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus 

15 mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, 
Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus 
pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio 
cholerae or Yersinia pestis or the sequences complementary thereto may be used to identify 
compounds which inhibit the growth of Acinetobacter baumannii, Anaplasma marginale, 

20 Aspergillus fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia 
burgdorferi, Burkholderia cepacia, Burkliolderia fungorum, Burkholderia mallei, Campylobacter 
jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata\ Candida tropicalis, 
Candida parapsilosis, Candida guilliermondii, Candida Icrusei, Candida kefyr (also called Candida 
pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, 

25 Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, 
Coccidioides immitis, Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter 
cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, 
Listeria monocytogenes, Moraxella catari'halis, Mycobacterium avium, Mycobacterium bovis, 

30 Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 

35 typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vidnificans, Yersinia enterocolitica, 
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Yersinia pestis or any species falling within the genera of any of the above species. In some 
embodiments of the present invention, the nucleic acids homologous to proliferation-required 
sequences from Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella 
pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter baumannii, 
5 Bacillus anthrads, Bacteroides fragilis, Bordeiella pertussis, Borrelia burgdorferi, Burkholderia 
cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Chlamydia 
pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium botulinum, 
Clostridium difficile, Coiynebacterium diptheriae, Enterobacter cloacae, Enterococcus faecium, 
Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria monocytogenes, 
10 Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, Mycobacterium leprae, 
Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria 
gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus mirabilis, Pseudomonas 
putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, Staphylococcus 
epidermidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus pneumoniae, 
15 Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio cholerae or 
Yersinia pestis (including nucleic acids homologous to one of SEQ ID NOs.: 6214-42397) or the 
sequences complementary thereto (including nucleic acids homologous to one of SEQ ID NOs.: 1- 
6213) are used to identify proliferation-required sequences in an organism other than E. coli. 

In another embodiment of the present invention, antisense nucleic acids complementary to the 
20 sequences identified as required for proliferation or portions thereof (including antisense nucleic acids 
comprising a nucleotide sequence complementary to one of SEQ ID NOs.: 6214-42397 or portions 
thereof, such as the nucleic acids of SEQ ID NOs.: 1-6213) are transferred to vectors capable of 
function within a species other than the species from which the sequences were obtained. For 
example, the vector may be functional in Acinetobacter baumannii, Anaplasma marginale, 
25 Aspergillus fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia 
burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkliolderia mallei, Campylobacter 
jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, 
Candida parapsilosis, Candida guilliermondii, Candida Icrusei, Candida kefyr (also called Candida 
pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, 
30 Clostridium acetobutylicum, Clostridium botxdinum, Clostridium difficile, Clostridium perfringens, 
Coccidioides immitis, Coiynebacterium diptheriae, Cryptococcus neoformans, Enterobacter 
cloacae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Histoplasma capsidatum, Klebsiella pneumoniae, Legionella pneumophila, 
Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
35 Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aemginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
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Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 

5 urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulniftcans, Yersinia enterocolitica, 
Yersinia pestis or any species falling within the genera of any of the above species. In some 
embodiments of the present invention, the vector may be functional in an organism other than E. 
coli. As would be appreciated by one of ordinary skill in the art, vectors may contain certain 
elements that are species specific. These elements can include promoter sequences, operator 

10 sequences, repressor genes, origins of replication, ribosomal binding sequences, termination 
sequences, and others. To use the antisense nucleic acids, one of ordinary skill in the art would 
know to use standard molecular biology techniques to isolate vectors containing the sequences of 
interest from cultured bacterial cells, isolate and purify those sequences, and subclone those 
sequences into a vector adapted for use in the species of bacteria to be screened. 

15 Vectors for a variety of other species are known in the art. For example, numerous vectors 

which function in E. coli are known in the art. Also, Pla et al. have reported an expression vector 
that is functional in a number of relevant hosts including: Salmonella typhimurium, Pseudomonas 
putida, and Pseudomonas aeruginosa. J. Bacterid. 172(8):4448-55 (1990). Brunschwig and 
Darzins (Gene (1992) 1 1 1:35-4, described a shuttle expression vector for Pseudomonas aeruginosa. 

20 Vectors useful for the production of stabilized mRNA having an increased lifetime (including 
antisense RNA) in Gram negative organisms are described in U.S. Provisional Patent Application 
Serial Number 60/343,5 12, filed December 21 , 2001 . Similarly many examples exist of expression 
vectors that are freely transferable among various Gram positive microorganisms. Expression 
vectors for Enterococcus faecalis may be engineered by incorporating suitable promoters into a 

25 pAK80 backbone (Israelsen, H., S. M. Madsen, A. Vrang, E. B. Hansen and E. Johansen. 1995. 
Appl. Environ. Microbiol. 61:2540-2547. A number of vectors useful for nucleic acid expression 
(including antisaise nucleic acid expression) in Enterococcus faecalis, Staphylococcus areus as 
well as other Gram positive organisms are described in U.S. Patent Application Serial Number 
10/032,393, filed December 21, 2001. 

30 Following the subcloning of the antisense nucleic acids complementary to proliferation- 

required sequences from Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, 
Klebsiella pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter 
baumannii, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, 
Burkholderia cepacia, Burkholderia fungonim, Burkholderia mallei, Campylobacter jejuni, 

35 Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium 
botidinum, Clostridium difficile, Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus 
faecium, Haemophilus influefizae, Helicobacter pylori, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
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Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus 
mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, 
Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus 

5 pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio 
cholerae or Yersinia pestis or portions thereof into a vector functional in a second cell or 
microorganism of interest (i.e. a cell or microorganism other than the one from which the identified 
nucleic acids were obtained), the antisense nucleic acids are conditionally transcribed to test for 
bacterial growth inhibition. The nucleotide sequences of the nucleic acids from Escherichia coli, 

10 Staphylococcus aureus, Enter ococcus faecalis, Klebsiella pneumoniae, Pseudomonas aeruginosa, 
Salmonella typhimurium, Acinetobacter baumannii, Bacillus anthracis, Bacteroides fragilis, 
Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, Burkholderia fungorum, 
Burkholderia mallei, Campylobacter jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, 
Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Corynebacterium 

15 diptheriae, Enterobacter cloacae, Enterococcus faecium, Haemophilus influenzae, Helicobacter 
pylon, Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium 
avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma 
genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella 
multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, 

20 Salmonella typhi, Staphylococcus epidermidis, Staphylococcus haemolyticus, Sti-eptococcus 
mutans, Streptococcus pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae or Yersinia pestis that, when transcribed, inhibit growth of the second 
cell or microorganism are compared to the known genomic sequence of the second cell or 
microorganism to identify the homologous gene from the second organism. If the homologous 

25 sequence from the second cell or microorganism is not known, it may be identified and isolated by 
hybridization to the proliferation-required Escherichia coli, Staphylococcus aureus, Enterococcus 
faecalis, Klebsiella pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter 
baumannii, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, 
Burkholderia cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, 

30 Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium 
botulinum, Clostridium difficile, Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus 
faecium, Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 

35 pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus 
mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, 
Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus 
pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio 
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cholerae or Yersinia pestis sequence of interest or by amplification using PCR primers based on the 
proliferation-required nucleotide sequence of interest as described above. In this way, sequences 
which may be required for the proliferation of the second cell or microorganism may be identified. 
For example, the second microorganism may be Acinetobacter baumannii, Anaplasma marginal* 
5 Aspergillus fumigates, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia 
burgdorferi, Burkholderia cepacia, BurkJwlderia fitngonim, Burkliolderia mallei, Campylobacter 
jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, 
Candida parapsilosis, Candida guilliermondii, Candida krusei t Candida kefyr (also called Candida 
pseudotropicalis\ Candida dubliniemis, Chlamydia pneumoniae, Chlamydia trachomatis, 
10 Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, 
Coccidioides immitis, Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter 
cloacae, Enter ococcus faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, 
Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
15 Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
20 typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis or any species falling within the genera of any of the above species. In some 
25 embodiments of the present invention, the second microorganism is an organism other than E. coli. 

The homologous nucleic acid sequences from the second cell or microorganism which are 
identified as described above may then be operably linked to a promoter, such as an inducible 
promoter, in an antisense orientation and introduced into the second cell or microorganism. The 
techniques described herein for identifying Escherichia coli, Staphylococcus aureus, Enterococcus 
30 faecalis, Klebsiella pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter 
baumannii, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, 
Burkholderia cepacia, Burkliolderia fungorum, Burkholderia mallei, Campylobacter jejuni, 
Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium 
botulinum, Clostridium difficile, Coiynebacterium diptheriae, Enterobacter cloacae, Enterococcus 
35 faecium, Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus 
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mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, 
Staphylococcus epidermidis t Staphylococcus haemolyticus t Streptococcus mutans, Streptococcus 
pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio 
cholerae or Yersinia pestis genes required for proliferation may thus be employed to determine 

5 whether the identified nucleotide sequences from a second cell or microorganism inhibit the 
proliferation of the second cell or microorganism. For example, the second microorganism may be 
Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, Bacillus anthracis, 
Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 

10 glabrata (also called Torulopsis glabrata\ Candida tropicalis, Candida parapsilosis, Candida 
guilliermondii, Candida krusei, Candida kefyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus 

15 faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tubercidosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 

20 haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaiis, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 

25 pneumoniae, Streptococcus muians, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae. Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis or any species falling within the genera of any of the above species. In some 
embodiments of the present invention, the second microorganism may be an organism other than E. 
coli. 

30 Antisense nucleic acids required for the proliferation of microorganisms other than 

Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, 
Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter baumannii, Bacillus anthracis, 
Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burldiolderia fungorum, Burkholderia mallei, Campylobacter jejuni, Chlamydia pneumoniae, 

35 Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, 
Coiynebacterium diptheriae, Enterobacter cloacae, Enterococcus faecium, Haemophilus 
influenzae, Helicobacter pylori, Legionella pneumophila, Listeria monocytogenes, Moraxella 
catarrhalis, Mycobacterium avium, Mycobacterium bovis, Mycobactenum leprae, Mycobacterium 
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tuberculosis, Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria 
meningitidis, Pasteurella multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas 
syringae, Salmonella paratyphi, Salmonella typhi, Staphylococcus epidermidis, Staphylococcus 
haemolyiicus, Streptococcus mutans, Streptococcus pneumoniae, Streptococcus pyogenes, 

5 Treponema pallidum, Ureaplasma urealyticum, Vibrio cholerae or Yersinia pestis or the genes 
corresponding thereto, may also be hybridized to a microarray containing the Escherichia coli, 
Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas aeruginosa, 
Salmonella typhimurium, Acinetobacter baumarmii, Bacillus anthracis, Bacteroides fragilis, 
Bordetella pertussis, Borrelia burgdorferi, Burldiolderia cepacia, Burklwlderia fungorum, 

10 Burkholderia mallei, Campylobacter jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, 
Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Corynebacterium 
diptheriae, Enterobacter cloacae, Enterococcus faecium, Haemophilus influenzae, Helicobacter 
pylori, Legionella pneumophila, Listeria monocytogenes,. Moraxella catarrhal, Mycobacterium 
avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tubercidosis, Mycoplasma 

15 genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella 
multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, 
Salmonella typhi, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
mutans, Streptococcus pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae or Yersinia pestis (including the nucleic acids of SEQ ID NOs.: 6214- 

20 42397) to gauge the homology between the Escherichia coli, Staphylococcus aureus, Enterococcus 
faecalis, Klebsiella pneumoniae, Pseudomonas aemginosa, Salmonella typhimwium, Acinetobacter 
baumannii, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, 
Burkholderia cepacia, Burkholderia fungorum, Burldiolderia mallei, Campylobacter jejuni, 
Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium 

25 botulinum, Clostridium difficile, Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus 
faecium, Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus 

30 mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, 
Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus 
pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio 
cholerae or Yersinia pestis sequences and the proliferation-required nucleic acids from other cells 
or microorganisms. For example, the proliferation-required nucleic acid may be from 

35 Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, Bacillus anthracis, 
Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burkholdena fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabrata (also called Torulopsis glabrata), Candida fropicalis, Candida parapsilosis, Candida 
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guilliermondii, Candida krusei, Candida kefyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
Corynebacteriwn diptheriae, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus 
5 faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhal*, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tubercidosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
10 haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
15 pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis or any species falling within the genera of any of the above species. In some 
embodiments of the present invention, the proliferation-required nucleotide sequences from. 
Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, 
20 Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter baumannii, Bacillus anthracis, 
Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Chlamydia pneumoniae, 
Chlamydia trachomatis, Clostridium acetobutylicwn, Clostridium botulinum, Clostridium difficile, 
Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus faecium, Haemophilus 
25 influenzae, Helicobacter pylori, Legionella pneumophila, Listeria monocytogenes, Moraxella 
catairhalis, Mycobacterium avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium 
tuberculosis, Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria 
meningitidis, Pasteurella multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas 
syringae, Salmonella paratyphi, Salmonella typhi, Staphylococcus epidermidis, Staphylococcus 
30 haemolyticus, Streptococcus mutans, Streptococcus pneumoniae, Streptococcus pyogenes, 
Treponema pallidum, Ureaplasma urealyticum, Vibrio cholerae or Yersinia pestis or homologous 
nucleic acids are used to identify proliferation-required sequences in an organism other than E. coli. 
In some embodiments of the present invention, the proliferation-required sequences may be from an 
organism other than E. coli. The proliferation-required nucleic acids from a cell or microorganism 
35 other than Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, 
Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter baumannii, Bacillus anthracis, 
Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Chlamydia pneumoniae, 
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Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, 
Corynebacterium diptheriae. Enterobacter cloacae, Enterococcus faecium, Haemophilus 
influenzae, Helicobacter pylori Legionella pneumophila, Listeria monocytogenes, Moraxella 
catarrhal, Mycobacterium avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium 
5 tuberculosis, Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria 
meningitidis, Pasteurella multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas 
syringae, Salmonella paratyphi, Salmonella typhi, Staphylococcus epidennidis, Staphylococcus 
haemolyticus, Streptococcus mutans, Streptococcus pneumoniae, Streptococcus pyogenes, 
Treponema pallidum, Ureaplasma urealyticum, Vibrio cholerae or Yersinia pestis may be 
10 hybridized to the array under a variety of conditions which permit hybridization to occur when the 
probe has different levels of homology to the nucleotide sequence on the microarray. This would 
provide an indication of homology across the cells or microorganisms as well as clues to other 
possible essential genes in these cells or microorganisms. 

In some embodiments of the present invention, the essential gene products described herein 
15 are used in methods of identifying a target on which a compound that inhibits cellular proliferation 
acts. Such methods are described in the U.S. Patent Application entitled METHODS FOR 
IDENTIFYING THE TARGET OF A COMPOUND WHICH INHIBITS CELLULAR 
PROLIFERATION, filed February 8, 2002. As employed herein, some embodiments of methods 
used to identify a target on which a compound that inhibits cellular proliferation acts utilize 
20 collections or cultures of strains comprising strains which either overexpress a different gene 
product which is required for cellular proliferation (such as the gene products described herein) or 
underexpress a different gene product (such as the. gene products described herein) which is 
required for cellular proliferation (i.e. at least some of the strains in the culture overexpress or 
underexpress a gene product required for cellular proliferation). In some embodiments, the present 
25 invention uses collections or cultures of strains comprising both strains which overexpress gene 
products required for cellular proliferation and strains which underexpress the same gene products 
required for cellular proliferation. Preferably, each of the strains present in the culture or collection 
either overexpresses or underexpresses a different gene product which is required for cellular 
proliferation (i.e. all of the strains in the culture overexpress or underexpress a gene product 
30 required for cellular proliferation). However, in some embodiments, the culture or collection may 
include one or more strains which do not overexpress or underexpress a gene product which is 
required for proliferation. The gene product which is overexpressed or underexpressed in each 
strain may be any gene product which is required for cellular proliferation, including a gene 
product whose activity or level is inhibited by a nucleic acid comprising a nucleotide sequence 
35 selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 6214- 
42397, a gene product comprising an amino acid sequence selected from the group consisting of 
SEQ ID NOs.: 42398-78581, a gene product whose activity or level is inhibited by a homologous 
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antisense nucleic acid, a gene product encoded by a homologous coding nucleic acid, and a gene 
product comprising a homologous polypeptide. 

As used herein the term "culture" refers to a plurality of strains growing in a single aliquot 
ofa liquid growth medium and the term "collection" refers to a plurality of strains each of which is 
5 growing in a separate aliquot of liquid growth medium or a different location on a solid growth 
medium. 

In some embodiments, if desired, one or more of the strains in the culture or collection of 
strains may overexpress or underexpress more than one gene product described herein which is 
required for cellular proliferation. In this embodiment, the gene products which are overexpressed 

10 . or underexpressed in one or more of the strains may be functionally related or functionally 
unrelated. This may facilitate the identification of compounds when two or more gene products 
share similar functions in the cell or where the cell has multiple biochemical pathways which lead 
to a particular end product. 

Alternatively, if the gene product described herein to be overexpressed or underexpressed is 

1 5 encoded by a gene which is part of an operon containing a plurality of genes, the desired gene may 
be overexpressed or underexpressed while the remaining genes in the operon are expressed at levels 
where they do not impact the ability of the cell to grow in the presence of a particular compound. 
For example, the desired gene may be placed under the control of a regulatable promoter, a 
transcriptional terminator may be placed 3' of the desired gene and a promoter, preferably a 

20 constitutive promoter, may be placed 3' of the transcriptional terminator and 5' of the remaining 
genes in the operon. 

In some embodiments, the culture or collection of strains may comprise a strain which 
overexpresses or underexpresses a gene product whose activity or level is inhibited by a nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213. 

25 In some embodiments, the culture or collection of strains may comprise strains which in aggregate 
overexpress or underexpress at least two gene products whose activity or level is inhibited by a 
nucleic acid selected from the group consisting of SEQ ID NOS.: 1-6213, at least 10 gene products 
whose activity or level is inhibited by a nucleic acid selected from the group consisting of SEQ ID 
NOS.: 1-6213, at least 20 gene products whose activity or level is inhibited by a nucleic acid 

30 selected from the group consisting of SEQ ID NOS.: 1-6213, at least 30 gene products whose 
activity or level is inhibited by a nucleic acid selected from the group consisting of SEQ ID NOS.: 
1-6213, at least 50 gene products whose activity or level is inhibited by a nucleic acid selected from 
the group consisting of SEQ ED NOS.: 1-6213, at least 100 gene products whose activity or level is 
inhibited by a nucleic acid selected from the group consisting of SEQ ID NOS.: 1-6213, at least 300 

35 gene products whose activity or level is inhibited by a nucleic acid selected from the group 
consisting of SEQ ED NOS.: 1-6213 or more than 300 gene products whose activity or level is 
inhibited by a nucleic acid selected from the group consisting of SEQ ID NOS.: 1-6213, wherein 
each strain in the culture or collection of strains overexpresses or underexpresses a single gene 
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product whose activity or level is inhibited by a nucleic acid selected from the group consisting 01 
SEQ ID NOs. 1-6213. Alternatively, if desired, one or more of the strains in the culture or 
collection of strains may overexpress or underexpress more than one gene product whose activity or 
level is inhibited by a nucleic acid selected from the group consisting of SEQ ID NOs. 1-6213. 
5 In other embodiments, the culture or collection of strains may comprise a strain which 

overexpresses or underexpresses a gene product encoded by a nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 6214-42397. In some embodiments, 
the culture or collection of strains may comprise strains which in aggregate overexpress or 
underexpress at least two gene products encoded by a nucleic acid comprising a nucleotide 
10 sequence selected from the group consisting of SEQ ID NOs.: 6214-42397, at least 10 gene 
products encoded by a nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 6214-42397, at least 20 gene products encoded by a nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 6214-42397, 
at least 30 gene products encoded by a nucleic acid comprising a nucleotide sequence selected from 
15 the group consisting of SEQ ID NOs.: 6214-42397, at least 50 gene products encoded by a nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 6214- 
42397, at least 100 gene products encoded by a nucleic acid comprising a nucleotide sequence 
selected from the group consisting of SEQ ID NOs.: 6214-42397, at least 300 gene products 
encoded by a nucleic acid comprising a nucleotide sequence selected from the group consisting of 
20 SEQ ID NOs.: 621442397 or more than 300 gene products encoded by a nucleic acid comprising a 
nucleotide sequence selected from the group consisting of SEQ ID NOs.: 6214-42397, wherein 
each strain in the culture or collection of strains overexpresses or underexpresses a single gene 
product encoded by a nucleic acid selected from the group consisting of SEQ ID NOs. 6214-42397. 
Alternatively, if desired, one or more strains in the culture or collection of strains may overexpress 
25 or underexpress more than one gene product encoded by a nucleic acid selected from the group 
consisting of SEQ ID NOs. 6214-42397. 

In some embodiments the culture or collection of strains comprises a strain in which a gene 
product comprising an amino acid sequence selected from the group consisting of SEQ ID NOs.: 
42938-78581 is overexpressed or underexpressed. In some embodiments, the culture or collection 
30 of strains may comprise strains which in aggregate overexpress or underexpress at least two gene 
products comprising an amino acid sequence selected from the group consisting of SEQ ID NOs.: 
42938-78581, at least 10 gene products comprising an amino acid sequence selected from the group 
consisting of SEQ ID NOs.: 42938-78581, at least 20 gene products comprising an amino acid 
sequence selected from the group consisting of SEQ ID NOs.: 42938-78581, at least 30 gene 
35 products comprising an amino acid sequence selected from the group consisting of SEQ ID NOs.: 
42938-78581, at least 50 gene products comprising an amino acid sequence selected from the group 
consisting of SEQ ID NOs.: 42938-78581, at least 100 gene products comprising an amino acid 
sequence selected from the group consisting of SEQ ID NOs.: 42938-78581, at least 300 gene 
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products comprising an amino acid sequence selected from the group consisting of SEQ ID NOs.: 
42938-78581 or more than 300 gene products comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOs.: 42938-78581, wherein each strain in the culture or collection of 
strains overexpresses or underexpresses a single gene product selected from the group consisting of 
5 SEQ ID NOs. 42938-78581. Alternatively, if desired one or more of the strains in the culture or 
collection of strains may overexpress or underexpress more than one gene product selected from the 
group consisting of SEQ ID NOs. 42938-78581 . 

In other embodiments, the culture or collection of strains comprises a strain in which at 
least one of the gene products encoded by a homologous coding nucleic acid as defined above is 
10 overexpressed or underexpressed. In some embodiments, the culture or collection of strains may 
comprise strains which in aggregate overexpress or underexpress at least 2, at least 10, at least 20, 
at least 30, at least 50, at least 100, at least 300 or more than 300 gene products encoded by a 
homologous coding nucleic acid as defined above. If desired the culture or collection of strains 
may comprise one or more strains which overexpress or underexpress more than one gene product 
15 encoded by a homologous coding nucleic acid. In further embodiments, the culture or collection of 
strains comprises a strain in which at least one, at least 10, at least 20, at least 30, at least 50, at least 
100, at least 300 or more than 300 homologous polypeptides as defined above is overexpressed or 
underexpressed. If desired the culture or collection of strains may comprise one or more strains 
which overexpress or underexpress more than one homologous polypeptide. 
20 For example, in some embodiments, the culture or collection of strains comprises a strain in 

which at least one gene product selected from the group consisting of a gene product having at least 
70% nucleotide sequence identity as determined using BLASTN version 2.0 with the default 
parameters to a gene product whose expression is inhibited by an antisense nucleic acid comprising 
a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product 
25 encoded by a nucleic acid having at least 70% nucleotide sequence identity as determined using 
BLASTN version 2.0 with the default parameters to a nucleic acid encoding a gene product whose 
expression is inhibited by an antisense nucleic acid comprising a nucleotide sequence selected from 
the group consisting of SEQ ID NOs: 1-6213, a gene product having at least 25% amino acid 
identity as determined using FASTA version 3.0t78 with the default parameters to a gene product 
30 whose expression is inhibited by an antisense nucleic acid comprising a nucleotide sequence 
selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic 
acid which hybridizes to a nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 1-6213 under stringent conditions, a gene product encoded by a nucleic 
acid which hybridizes to a nucleic acid comprising a nucleotide sequence selected from the group 
35 consisting of SEQ ID NOs.: 1-6213 under moderate conditions, and a gene product whose activity 
may be complemented by the gene product whose activity is inhibited by a nucleic acid comprising 
a nucleotide sequence selected from the group consisting of SEQ ID NOs: 1-6213 is overexpressed 
or underexpressed, wherein each strain overexpresses or underexpresses one gene product In some 
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embodiments, the culture or collection of strains may comprise strains in which in aggregate at least 
2, at least 10, at least 20, at least 30, at least 50, at least 100, at least 300, or more than 300 gene 
products selected from the group consisting of a gene product having at least 70% nucleotide 
sequence identity as determined using BLASTN version 2.0 with the default parameters to a gene 
5 product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a 
nucleic acid having at least 70% nucleotide sequence identity as determined using BLASTN 
version 2.0 with the default parameters to a nucleic acid encoding a gene product whose expression 
is inhibited by an antisense nucleic acid comprising a nucleotide sequence selected from the group 
10 consisting of SEQ ID NOs: 1-6213, a gene product having at least 25% amino acid identity as 
determined using FASTA version 3.0t78 with the default parameters to a gene product whose 
expression is inhibited by an antisense nucleic acid comprising a nucleotide sequence selected from 
the group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic acid which 
hybridizes to a nucleic acid comprising a nucleotide sequence selected from the group consisting of 
15 SEQ ID NOs.: 1-6213 under stringent conditions, a gene product encoded by a nucleic acid which 
hybridizes to a nucleic acid comprising a nucleotide sequence selected from the group consisting of 
SEQ ID NOs.: 1-6213 under moderate conditions, and a gene product whose activity may be 
complemented by the gene product whose activity is inhibited by a nucleic acid comprising a 
nucleotide sequence selected from the group consisting of SEQ ID NOs: 1-6213 is overexpressed 
20 or underexpressed, wherein each strain overexpresses or underexpresses one gene product. 

If desired, one or more of the strains in the culture or collection of strains may overexpress 
or underexpress more than one gene product selected from the group consisting of a gene product 
having at least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with the 
default parameters to a gene product whose expression is inhibited by an antisense nucleic acid 
25 comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a 
gene product encoded by a nucleic acid having at least 70% nucleotide sequence identity as 
determined using BLASTN version 2.0 with the default parameters to a nucleic acid encoding a 
gene product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs: 1-6213, a gene product having at least 
30 25% amino acid identity as determined using FASTA version 3.0t78 with the default parameters to 
a gene product whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a 
nucleic acid which hybridizes to a nucleic acid comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NOs.: 1-6213 under stringent conditions, a gene product encoded by a 
35 nucleic acid which hybridizes to a nucleic acid comprising a nucleotide sequence selected from the 
group consisting of SEQ ED NOs.: 1-6213 under moderate conditions, and a gene product whose 
activity may be complemented by the gene product whose activity is inhibited by a nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 1-6213. 
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In further embodiments, the culture or collection of strains comprises a strain m wmcn at 
least one gene product encoded by a nucleic acid comprising a nucleotide sequence selected from 
the group consisting of a nucleic acid comprising a nucleic acid having at least 70% nucleotide 
sequence identity as determined using BLASTN version 2.0 with the default parameters to a 
5 nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214-42397, a nucleic 
acid comprising a nucleotide sequence which hybridizes to a sequence selected from the group 
consisting of SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic acid comprising 
a nucleotide sequence which hybridizes to a nucleotide sequence selected from the group consisting 
of SEQ ID NOS.: 6214-42397 under moderate conditions is overexpressed or underexpressed, 
10 wherein each strain overexpresses or underexpresses one gene product. In some embodiments, the 
culture or collection of strains comprises a strain or a group of strains in which in aggregate at least 
2, at least 10, at least 20, at least 30, at least 50, at least 100, at least 300, or more than 300 gene 
products encoded by a nucleic acid comprising a nucleotide sequence selected from the group 
consisting of a nucleic acid comprising a nucleic acid having at least 70% nucleotide sequence 
15 identity as determined using BLASTN version 2.0 with the default parameters to a nucleotide 
sequence selected from the group consisting of SEQ ID NOS.: 6214-42397, a nucleic acid 
comprising a nucleotide sequence which hybridizes to a sequence selected from the group 
consisting of SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic acid comprising 
a nucleotide sequence which hybridizes to a nucleotide sequence selected from the group consisting 
20 of SEQ ID NOS.: 6214-42397 under moderate conditions is overexpressed or underexpressed, 
wherein each strain overexpresses or underexpresses one gene product. 

If desired, one or more of the strains in the culture or collection of strains may overexpress 
or underexpress more than one gene product encoded by a nucleic acid comprising a nucleotide 
sequence selected from the group consisting of a nucleic acid comprising a nucleic acid having at 
25 least 70% nucleotide sequence identity as determined using BLASTN version 2.0 with the default 
parameters to a nucleotide sequence selected from the group consisting of SEQ ID NOS.: 6214- 
42397, a nucleic acid comprising a nucleotide sequence which hybridizes to a sequence selected 
from the group consisting of SEQ ID NOS.: 6214-42397 under stringent conditions, and a nucleic 
acid comprising a nucleotide sequence which hybridizes to a nucleotide sequence selected from the 
30 group consisting of SEQ ID NOS.: 6214-42397 under moderate conditions. 

In additional embodiments, the culture or collection of strains comprises a strain in which 
at least one gene product comprising a polypeptide selected from the group consisting of a 
polypeptide having at least 25% amino acid identity as determined using FASTA version 3.0t78 to 
a polypeptide selected from the group consisting of SEQ ID NOs.: 42938-78581 and a polypeptide 
35 whose activity may be complemented by a polypeptide selected from the group consisting of SEQ 
ID NOs: 42938-78581 is overexpressed or underexpressed, wherein each strain overexpresses or 
underexpresses one gene product. In some embodiments, the culture or collection of strains 
comprises a strain or a group of strains in which in aggregate at least 2, at least 10, at least 20, at 

164 



WO 02/077183 PCT7US02/09107 
least 30, at least 50, at least 100, at least 300, or more than 300 gene products comprising a 
polypeptide selected from the group consisting of a polypeptide having at least 25% amino acid 
identity as determined using FASTA version 3.0t78 to a polypeptide selected from the group 
consisting of SEQ ID NOs.: 42938-78581 and a polypeptide whose activity may be complemented 

5 by a polypeptide selected from the group consisting of SEQ ID NOs: 42938-78581 is overexpressed 
or underexpressed, wherein each strain overexpresses or underexpresses one gene product. 

If desired, one or more of the strains in the culture or collection of strains may overexpress 
or underexpress more than one polypeptide selected from the group consisting of a polypeptide 
having at least 25% amino acid identity as determined using FASTA version 3.0t78 to a 

10 polypeptide selected from the group consisting of SEQ ID NOs.: 42938-78581 and a polypeptide 
whose activity may be complemented by a polypeptide selected from the group consisting of SEQ 
ID NOs: 42938-78581. 

The methods of the present invention may be used to identify the targets of compounds 
which inhibit the proliferation of any desired cell or organism. In some embodiments, these 

15 methods are employed to identify the targets of compounds which inhibit the proliferation of 
bacteria, fungi, or protozoans. In further embodiments, these methods are employed to identify the 
targets of compounds which inhibit the growth of an organism selected from the group consisting of 
Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, Bacillus anthracis, 
Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 

20 Burkholderia fungomm, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
guilliermondii, Candida krusei, Candida kefyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium peifringens, Coccidioides immitis, 

25 Corynebacterium diptheriae, Cryptococcus neofonnans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tubercidosis, Mycoplasma genitalium, Mycoplasma 

30 pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteiiae, Shigella flexneri, Shigella sonnei, 

35 Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutaits, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis or any species falling within the genera of any of the above species. 
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Overexpression may be obtained using a variety of techniques familiar to those skilled in 
the art. For example, overexpression may be obtained by operably linking a gene encoding a gene 
product whose activity or level is inhibited by a nucleic acid comprising a nucleotide sequence 
selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic 
5 acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 6214- 
42397, a gene product comprising an amino acid sequence selected from the group consisting of 
SEQ ID NOs.: 42398-78581, a gene product whose activity or level is inhibited by a homologous 
antisense nucleic acid, a gene product encoded by a homologous coding nucleic acid, or a gene 
product comprising a homologous polypeptide to a promoter which transcribes a higher level of 
10 mRNA encoding or comprising the gene product than does a wild type cell 

A variety of promoters may be used to overexpress the gene product described herein, 
including a gene product whose activity or level is inhibited by a nucleic acid comprising a 
nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product 
encoded by a nucleic acid comprising a nucleotide sequence selected from the group consisting of 
15 SEQ ID NOs.: 6214-42397, a gene product comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOs.: 42398-78581, a gene product whose activity or level is inhibited 
by a homologous antisense nucleic acid, a gene product encoded by a homologous coding nucleic 
acid, and a gene product comprising a homologous polypeptide. The promoters used to overexpress 
the gene product may be relatively strong promoters, promoters which possess a moderate level of 
20 activity, or relatively weak promoters and may be either constitutive or regulatable promoters. In 
some embodiments, several strains, each of which overexpresses the gene product to a different 
extent, may be used in order to optimize the degree of overexpression of the gene product. 

In some embodiments, each of the gene products required for proliferation may be placed 
under the control of several different promoters of varying strengths to create several different 
25 strains which express the gene product at varying levels. The level of expression of the gene 
product in each of the strains is compared to that in wild type cells in order to identify a promoter 
which provides a desired level of expression relative to wild type cells (i.e. a desired level of 
overexpression or underexpression). The strain having the desired level of expression is then 
included in a culture or collection of strains to be contacted with a test compound as discussed 
30 below. Examples of suites of regulatable promoters having varying strengths that are useful for the 
expression of gene products at varying levels are described in U.S. Patent Application Serial 
Number 10/032,393, filed on December 21, 2002. 

The promoter is selected to be active in the type of cell in which the gene product is to be 
expressed. For example, for overexpression of the gene product in mammalian cells, the gene 
35 encoding the gene product may be operably linked to promoters such as the SV40 promoter, the 
metallothionine promoter, the MMTV promoter, the RSV promoter, the tetP promoter, the 
adenovirus major late promoter or other promoters known to those skilled in the art. In yeast, the 
gene encoding the gene product may be operably linked to promoters such as the CYC1, ADHI, 

166 



WO 02/077183 PCT7US02/09107 
ADHH, GAL1, GAL10, PH05, PGK or other promoters used in the art Similarly, in bactena, me 
gene encoding the gene product may be operably linked to the , SP6, T3, trc promoter, lac 
promoter, temperature regulated lambda promoters, the Bacillus aprE and nprE promoters (U.S. 
Patent No. 5,387,521), the bacteriophage lambda P L and P R promoters (Renaut, et al, (1981) Gene 
5 15: 81) the trp promoter (Russell, et al., (1982) Gene 20: 23), the tac promoter (de Boer et al., 

(1983) Proc. NatL Acad. Sci. USA 80: 21), B. subtilis alkaline protease promoter (Stahl et al, 

(1984) J. Bacterid. 158, 411-418) alpha amylase promoter of 5. subtilis (Yang et al., (1983) 
Nucleic Acids Res. 11, 237-249) or B. amyloliquefaciens (Tarkinen, et al, (1983) J. Biol. Chem. 
258, 1007-1013), the neutral protease promoter from B. subtilis (Yang et al, (1984) J. Bacteriol. 

10 160, 15-21), T7 RNA polymerase promoter (Studier and Moffatt (1986) J Mol Biol. 189(1):113- 
30), B. subtilis xyl promoter or mutant tetR promoter active in bacilli (Geissendorfer & Hillen 
(1990) Appl. Microbiol. Biotechnol. 33:657-663), Staphylococcal enterotoxin D promoter (Zhang 
and Stewart (2000) J. Bacteriol. 182(8):2321-5), cap8 operon promoter from Staphylococcus aureus 
(Ouyang et al., (1999) J. Bacteriol. 181(8):2492-500), the lactococcal nisA promoter (Eichenbaum 

1 5 (1998) Appl Environ Microbiol. 64(8):2763-9), promoters from in Acholeplasma laidlawii (Jarhede 
et al., (1995) Microbiology 141 ( Pt 9):2071-9), porA promoter of Neisseria meningitidis (Sawaya 
et al., (1999) Gene 233:49-57), the fbpA promoter of Neisseria gonorrhoeae (Forng et al., (1997) J. 
Bacteriol. 179:3047-3052), Corynebacterium diphtheriae toxin gene promoter (Schmitt and Holmes 
(1994) J. Bacteriol. 1 76(4): 1 141-9), the hasA operon promoter from Group A Streptococci (Alberti 

20 et al., (1998) Mol Microbiol 28(2):343-53), the rpoS promoter of Pseudomonas putida (Kojic and 
.Venturi (2001) J. Bacteriol. 183:3712-3720), the Acinetobacter baumannii phosphate regulated ppk 
gene promoter (Gavigan et al., Microbiology 145:2931-7 (1999)); the Acinetobacter baumannii 
adhCl promoter which is induced under iron limitation and repressed when the cells are cultured in 
the presence of fee inorganic iron (Echenique et al., Microbiology 147:2805-15 (2001)); XheflaB 

25 promoter of pGK12 active in Borrelia burgdorferi (Sartakova et al., Proc Natl Acad Sci USA. 
97(9):4850-5 (2000)); the use of Ptrc promoter results in strong inducer-dependent expression in 
Burkholderia spp (Santos et al., FEMS Microbiol Lett 195(l):91-6 (2001)); the iron regulated sodA 
promoter of Bordetella pertussis (Graeff-Wohlleben et al, J Bacteriol 179(7):2194-201 (1997)); 
UV-inducible ben and uviAB promoters in Clostrdia spp (Gamier and Cole Mol Microbiol 

30 2(5):607-14 (1988)); the heat-inducible clpB promoter of Campylobacter jejuni (Thies et al., Gene 
230(l):61-7 (1999)); promoters carrying bacteriophage CI operator sites in Klebsiella pneumoniae 
(Schoefield et al, J Bacteriol 183(23):6947-50 (2001)); the Proteus mirabilis ureR promoter (Poore 
et al., J Bacteriol 183(15):4526-35 (2001)); and the heat-inducible groESL promoter in Listeria 
monocytogenes, and the EPTG inducible promoter in pLEX5BA (Krause et al., J. Mol. Biol. 274: 

35 365 (1997). In another embodiment, which may be useful in Staphylococcus aureus, the promoter 
is a novel inducible promoter system, XylT5, comprising a modified T5 promoter fused to the xylO 
operator from the xylA promoter of Staphylococcus aureus. This promoter is described in U.S. 
Patent Application Serial Number 10/032,393. In another embodiment the promoter may be a two- 
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component inducible promoter system in which the T7 RNA polymerase gene is integratea on me 
chromosome and is regulated by lacUWSI lacO (Brunschwig,.E. and Darzins, A. 1992. Gene 
1 1 1:35-41, and a T7 gene 10 promoter, which is transcribed by T7 RNA polymerase, is fused with a 
lacO operator. In another embodiment the promoter may be the promoters from the plasmids 
5 pEPEF3 or pEPEF14, which harbor xylose inducible promoters functional in E. faecalis, described 
in U.S. Patent Application Serial No. 10/032,393. Other promoters which may be used are familiar 
to those skilled in the art. In fungi, the gene encoding the gene product may be operably linked to 
the CaACTl promoter (Morschhauser, Mol. Gen. Genet. 257: 412-420 (1998), or other promoters 
familiar to those skilled in the art. It will appreciated that other combinations of organisms and 
1 0 promoters may also be used in the present invention. 

In some embodiments, overexpression may be achieved by using homologous 
recombination to replace the natural promoter which drives expression of the proliferation-required 
genes described herein with a regulatable promoter. For example, the methods described in U.S. 
Patent Application 09/948,993 may be used to place the gene required for proliferation under the 
15 control of a regulatable promoter. Examples of gene products, which are encoded by genes that can 
be overexpressed by regulatable promoters introduced by such promoter replacement methods 
include a gene product whose activity or level is inhibited by a nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a gene product encoded by a 
nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 
20 6214-42397, a gene product comprising an amino acid sequence selected from the group consisting 
of SEQ ID NOs.: 42398-78581, a gene product whose activity or level is inhibited by a homologous 
antisense nucleic acid, a gene product encoded by a homologous coding nucleic acid, and a gene 
product comprising a homologous polypeptide. 

Briefly, in some embodiments of these methods in which natural promoters are replaced by 
25 regulatable promoters, the cells may be haploid, such as bacterial cells. Regulatable promoters that 
are useful for promoter replacement in bacterial cells include, but are not limited to, the promoters 
described in U.S. Patent Application Serial Number 10/032,393 filed December 21, 2001. A linear 
promoter replacement cassette comprising a regulatable promoter flanked by nucleotide sequences 
having homology to the natural promoter is introduced into the cell. In some embodiments, the 
30 cassette also comprises a nucleotide sequence encoding a selectable marker or a marker whose 
expression is readily identified. The cassette may be a double stranded nucleic acid or a single 
stranded nucleic acid as described in U.S. Patent Application Serial Number 09/948,993. Upon 
homologous recombination, the natural promoter is replaced with the regulatable promoter, leaving 
the gene required for proliferation under the control of the regulatable promoter. Strains in which 
35 the gene required for proliferation is under control of the regulatable promoter are grown under 
conditions in which the regulatable promoter provides a level of the proliferation-required gene 
product which is above the level in a wild type cell. For example, the strains may be grown in the 
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presence of an inducer which induces expression from the regulable promoter, or under coSaiuons 
in which the action of a repressor on the regulatable promoter is reduced or eliminated. 

Alternatively, rather than replacing the native promoters of each of the genes encoding a 
proliferation-required gene product described herein with a single desired replacement promoter, a 
5 plurality of replacement promoters which provide desired expression levels for the gene products to 
be overexposed or underexposed are used. The method is performed as described above except 
that rather than using a single labeled primer complementary to a nucleotide sequence within the 
single replacement promoter, a plurality of labeled primers complementary to suitable nucleotide 
sequences in the plurality of replacement promoters are used. 
10 Alternatively, in embodiments in which the level or activity of proliferation-required gene 

products described herein is reduced by transcribing an antisense nucleic acid complementary to at 
least a portion of the genes encoding such gene products, the strains may be designed such that the 
length of the nucleotide sequence encoding the antisense nucleic acid is different for each gene. 
Amplification reactions are performed as described above using primers at each end of the gene 
15 encoding the antisense nucleic acid such that the amplification product corresponding to each gene 
has a unique length or a dye which allows it to be distinguished from other amplification products 
of the same length. Alternatively, the lengths of the nucleotide sequences encoding the antisense 
nucleic acids may not be unique for each gene, but the primers used in the amplification reaction 
may be selected such that the length of the amplification product corresponding to each gene is 
20 unique. 

In another embodiment, the native promoters may be replaced with promoters which 
include therein or adjacent thereto a unique nucleotide sequence which is distinct from that present 
in the other replacement promoters in the strains in the culture or collection of strains. In this 
embodiment, each promoter includes or has adjacent thereto a unique "tag" which may be used to 

25 identify strains which proliferate more rapidly or more slowly in the culture or collection of strains. 
The tag may be detected using hybridization based methods or amplification based methods, 
including the amplification method which generates amplification products having a unique size for 
each proliferation required gene described above. 

Alternatively, the native promoter which directs the transcription of the proliferation- 

30 required genes described herein may rendered regulatable by inserting a regulatory element into the 
chromosome of the cell via homologous recombination such that the regulatory element regulates 
the level of transcription from the promoter. Examples of gene products, which are encoded by 
genes that have promoters which can be rendered regulatable by regulatory elements inserted by 
such methods include a gene product whose activity or level is inhibited by a nucleic acid 

35 comprising a nucleotide sequence selected from the group consisting of SEQ ED NOs.: 1-6213, a 
gene product encoded by a nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOs.: 6214-42397, a gene product comprising an amino acid sequence 
selected from the group consisting of SEQ ID NOs.: 42398-78581, a gene product whose activity or 
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level is inhibited by a homologous antisense nucleic acid, a gene product encoded by a homolololis 
coding nucleic acid, and a gene product comprising a homologous polypeptide. 

A variety of regulatory elements may be used to regulate the expression of essential gene 
products described herein. The regulatory element may be an operator which is recognized by a 
5 repressor (e.g. lac, tet, araBAD repressors) or a nucleotide sequence which is recognized by a 
transcriptional activator. In some embodiments, the regulatory element may be a transcriptional 
terminator, a nucleotide sequence which introduces a bend in the DNA or an upstream activating 
sequence. A linear regulatory element insertion cassette comprising a regulatory element flanked 
by nucleotide sequences having homology to the natural promoter is introduced into the cell. In 
10 some embodiments, the cassette also comprises a nucleotide sequence encoding a selectable marker 
or a marker whose expression is readily identified. The cassette may be a double stranded nucleic 
acid or a single stranded nucleic acid as described in U.S. Patent Application Serial Number 
09/948,993. Upon homologous recombination, the regulatory element is inserted into the 
chromosome, leaving the gene required for proliferation under the control of the regulatory 
15 element. Strains in which the gene required for proliferation is under control of the regulatory 
element are grown under conditions in which the regulatable promoter provides a level of the 
proliferation-required gene product which is above the level in a wild type cell. For example, the 
strains may be grown in the presence of an inducer which induces expression from the promoter, or 
under conditions in which the action of a repressor on the promoter is reduced or eliminated. It will 
20 be appreciated that the amplification method which generates amplification products having a 
unique size for each proliferation required gene may be used to detect strains which are 
overrepresented or underrepresented in the culture or collection of strains. For example, if desired, 
primers complementary to a nucleotide sequence within the regulatory element may be used in the 
amplification reaction. 

25 The promoter replacement cassette or regulatory element insertion cassette may be a double 

stranded nucleic acid, such as an amplicon generated through PCR or other amplification methods, 
or a single stranded nucleic acid, such as an oligonucleotide. For example, single stranded nucleic 
acids may be introduced into the chromosome using the methods described in Ellis et al., PNAS 98: 
6742-6746, 2001. 

30 In some embodiments, the cell into which the promoter replacement cassette or regulatory 

element insertion cassette is introduced has an enhanced frequency of recombination. For example, 
the cells may lack or have a reduced level or activity of one or more exonucleases which would 
ordinarily degrade the DNA to be inserted into the chromosome. In further embodiments, the cells 
may both lack or have reduced levels of exonucleases and express or overexpress proteins involved 

35 in mediating homologous recombination. For example, if the methods are performed in 
Escherichia coli or other enteric prokaryotes, cells in which the activity of exonuclease V of the 
RecBCD recombination pathway, which degrades linear nucleic acids, has been reduced or 
eliminated, such as recB, recC, or recD mutants may be used. In some embodiments, the cells have 
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mutations in more than one of the recB, recC, and recD genes which enhance the frequency of 
homologous recombination. For example the cells may have mutations in both the recB and recC 
genes. 

The promoter replacement or regulatory element insertion methods may also be performed 
5 in Escherichia coli cells in which the activity of the RecET recombinase system of the Rac 
prophage has been activated, such as cells which carry an sbcA mutation. The RecE gene of the rac 
prophage encodes ExoVffl a 5'-3' exonuclease, while the RecT gene of the Rac prophage encodes a 
single stranded DNA binding protein which facilitates renaturation and D-loop formation. Thus, 
the gene products of the RecE and RecT genes or proteins with analogous functions facilitate 
10 homologous recombination. The RecE and RecT genes lie in the same operon but are normally not 
expressed. However, sbcA mutants activate the expression the RecE and RecT genes. In some 
embodiments, the methods may be performed in cells which cany mutations in the recB and recC 
genes as well as the sbcA mutation. The RecE and RecT gene may be constitutively or 
conditionally expressed. For example, the methods may be performed in E coli strain JC8679, 
1 5 which carries the sbc A23 , recB2 1 and recC22 mutations. 

In some embodiments, the methods may be performed in Escherichia coli cells in which 
recombination via the RecF pathway has been enhanced, such as cells which carry an sbcB 
mutation. 

It will be appreciated that the RecE and RecT gene products, or proteins with analogous 

20 functions may be conditionally or constitutively expressed in prokaryotic organisms other than E. 
coli. In some embodiments, these proteins may be conditionally or constitutively expressed in 
Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, Bacillus anthracis, 
Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 

25 glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
guilliermondii, Candida krusei, Candida kejyr (also called Candida pseudotropicalis\ Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus 

30 faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma geniialium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 

35 haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enierica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
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Staphylococcus aureus, Staphylococcus epidennidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio par ahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis or any species falling within the genera of any of the above species. For example, 
5 plasmids encoding these gene products may be introduced into the organism. If desired, the coding 
sequences encoding these gene products may be optimized to reflect the codon preferences of the 
organism in which they are to be expressed. Similarly, in some embodiments, the organism may 
contain mutations analogous to the recB, recC, recD, sbcA or sbcB mutations which enhance the 
frequency of homologous recombination. 
10 In further embodiments, the promoter replacement or regulatory element insertion methods 

may be conducted in cells which utilize the Red system of bacteriophage lambda (X) or analogous 
systems from other phages to enhance the frequency of homologous recombination. The Red 
system contains three genes, (y, p and exo whose products are the Gam, Bet and Exo proteins (see 
Ellis et al. PNAS 98:6742-6746, 2001. The Gam protein inhibits the RecBCD exonuclease V, thus 
15 permitting Beta and Exo to gain access to the ends of the DNA to be integrated and facilitating 
homologous recombination. The Beta protein is a single stranded DNA binding protein that 
promotes the annealing of a single stranded nucleic acid to a complementary single stranded nucleic 
acid and mediates strand exchange. The Exo protein is a double-stranded DNA dependent 5'-3' 
exonuclease that leaves 3' overhangs that can act as substrates for recombination. Thus, 
20 constitutive or conditional expression of the X Red proteins or proteins having analogous functions 
facilitates homologous recombination. 

It will be appreciated that the X Beta, Gam and Exo proteins, or proteins with analagous 
functions may be expressed constitutively or conditionally in prokaryotic organisms other than J?. 
colL In some embodiments, these proteins may be conditionally or constitutively expressed in 
25 Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, Bacillus anthracis, 
Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
guilliermondii, Candida krusei, Candida kefyr (also called Candida pseudotropicalis\ Candida 
30 dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
Corynebacterium diptheriae, Ciyptococcus neoformans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma eapsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
35 monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tubercidosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 

172 



WO 02/077183 PCT/US02/09107 

Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syritigde/ S^ 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
5 pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus. Vibrio vulnificans f Yersinia enterocolitica, 
Yersinia pestis or any species falling within the genera of any of the above species. For example, 
plasmids encoding these gene products may be introduced into the organism. If desired, the coding 
sequences encoding these gene products may be optimized to reflect the codon preferences of the 
10 organism in which tfiey are to be expressed. 

In some embodiments, the cells may have an increased frequency of homologous 
recombination as a result of more than one of the aforementioned characteristics. In some 
embodiments, the enhanced frequency of recombination may be a conditional characteristic of the 
cells which depends on the culture conditions in which the cells are grown. For example, in some 
15 embodiments, expiession of the X Red Gam, Exo, and Beta proteins or recE and recT proteins may 
be regulated. Thus, the cells may have an increased frequency of homologous recombination as a 
result of any combination of the aforementioned characteristics. For example, in some 
embodiments, the cell may carry the sbcA and recBC mutations. 

In some embodiments, a linear double stranded DNA to be inserted into the chromosome of 
20 the organism is introduced into an organism constitutively or conditionally expressing the recE and 
recT or the X Beta, Gam and Exo proteins or proteins with analogous functions as described above. 
In some embodiirents, the organism may be Acinetobacter baumannii, Anaplasma marginale, 
Aspergillus fumigatus, Bacillus anthracis, Bacteroides fragilis, Bdrdetella pertussis, Borrelia 
burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter 
25 jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, 
Candida parapsilasis, Candida guilliermondii, Candida krusei, Candida kefyr (also called Candida 
pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, 
Clostridium acetdbutylicum, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, 
Coccidioides mutatis, Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter 
30 cloacae, Enterocoecus faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Histoplasma capsidatum, Klebsiella pneumoniae, Legionella pneumophila, 
Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium kprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
35 haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, SMgella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 

173 



WO 02/077183 PCT/US02/09107 

Staphylococcus aureus. Staphylococcus epidermidis, Staphylococcus haemolyiieus, SB**UBt* 
pneumoniae. Streptococcus mutatis, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis or any species falling within the genera of any of the above species. In some 
5 embodiments, the double stranded DNA may be introduced into an organism having the recBC and 
sbcA mutations or analogous mutations. 

In other embodiments, a single stranded DNA to be inserted into the chromosome of the 
organism is introduced into an organism expressing the X Beta protein or a protein with an 
analogous function. In some embodiments the single stranded DNA is introduced into an organism 
10 expressing both the X Beta and Gam proteins or proteins with analogous functions. In further 
embodiments, the single stranded DNA is introduced into an organism expressing the X Beta, Gam 
and Exo proteins or proteins with analogous functions. The X proteins or analogous proteins may 
be expressed constitutively or conditionally. In some embodiments, the organism may be 
Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus. Bacillus anthracis, 
15 Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkliolderia cepacia, 
Burkliolderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
guilliermondii, Candida krusei, Candida kejyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
20 Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 
Corynebacterium diptheriae, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faeciwn, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila. Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
25 Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella muliocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
30 typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans, Yersinia enterocolitica, 
Yersinia pestis or any species falling within the genera of any of the above species. 
35 In some embodiments, the linear nucleic acid may be introduced into the chromosome of a 

first organism which has an enhanced frequency of homologous recombination and then transferred 
to a second organism which is less amenable to direct application of the present methods. For 
example, the linear nucleic acid may be introduced into the chromosome of E. coli and transferred 
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into a second organism via conjugation or transduction. After introduction into the second 
organism, the nucleic acid is inserted into the chromosome of the second organism via homologous 
recombination, thereby effectively transferring the regulatory element from the chromosome of the 
first organism into the corresponding location in the chromosome of the second organism. 
5 In other embodiments, the cells may be diploid cells, such as fungal cells. In some 

embodiments, one copy of the gene encoding the proliferation-required gene product may be 
disrupted, rendering it inactive. In further embodiments, one copy of the gene encoding the 
proliferation-required gene product may be disrupted and the other copy of the gene encoding the 
prohferation-required gene product may be placed under the control of a regulatable promoter. 

10 Such strains may be generated by disrupting the first copy of the gene encoding the proliferation- 
required gene product by homologous recombination using a disruption cassette comprising a 
nucleotide sequence encoding an expressible dominant selectable marker flanked on each side by 
nucleic acids homologous to the target sequence to be disrupted. The second copy of the gene 
encoding the proliferation-required gene product may be placed under the control of a regulatable 

15 promoter by homologous recombination using a promoter replacement cassette comprising a 
regulatable promoter flanked on each side by nucleic acids homologous to the natural promoter for 
the proliferation-required gene. The promoter replacement cassette may also include a nucleotide 
sequence encoding a selectable marker located 5* of the regulatable promoter but between the 
nucleic acids homologous to the natural promoter. 

20 In other embodiments, overexpression may be achieved by operably linking a proliferation- 

required gene product described herein to a desired promoter in a vector. The vector may be a 
vector which replicates extrachromosomally or a vector which integrates into the chromosome. For 
example, if the vector is to be used in bacterial cells, the vector may be a pBR322 based vector or a 
bacteriophage based vector such as PI or lambda. If the vector is to be used in Saccharomyces 

25 cerevisae, it may be a vector based on the 2 micron circle or a vector incorporating a yeast 
chromosomal origin of replication. If the vector is to be used in mammalian cells, it may be a 
retroviral vector, SV40 based vector, a vector based on bovine papilloma virus, a vector based on 
adenovirus, or a vector based on adeno-associated virus. If the vector is to be used in Candida 
albiccms it may be a vector comprising a promoter selected from the group consisting of the 

30 CaPCKl, MET25, MAL2, PH05, GAL1,10, STE2 or STE3 promoters. In some embodiments, the 
vectors described in the following publications may be used: CIplO, an efficient and convenient 
integrating vector for Candida albicans. Murad et al., Yeast 16(4):325-7 (2000); Transforming 
vector pCPW7, Kvaal et al., : Infect Immun 67(12):6652-62 (1999); Transforming vector 
pCWOP16, Kvaal et al., : Infect Immun 65(ll):4668-75 (1997); double-ARS vector, pRMl, to be 

35 used for direct cloning in Ca by complementation of the histidine auxotrophy of strain CA9, Pla et 
al., Gene 165(l):115-20 (1995); pMK16, that was developed for the transformation of C. albicans 
and carries an ADE2 gene marker and a Candida autonomously replicating sequence (CARS) 
element promoting autonomous replication (cited in Sanglard and Fiechter Yeast 8(12): 1065-75 
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(1992); A plasmid vector (denoted pRC2312) was constructed, which replicates autonomously in 
Escherichia coli, Saccharomyces cerevisiae and Candida albicans. It contains LEU2, URA3 and an 
autonomously replicating sequence (ARS) from C. albicans, Cannon et al., Mol Gen Genet 235(2- 
3):453-7 (1992); Expression vector (CIplO-MAL2p) for use in Candida albicans has been 
5 constructed in which a gene of interest can be placed under the control of the CaMAL2 maltase 
promoter and stably integrated at the CaRPIO locus (Backen et al., Yeast 16(12):1 121-9 (2000)); 
(Volker, R. S., A- Sonneborn, C. E. Leuker, and J. F. Ernst. 1997. Efglp, an essential regulator of 
morphogenesis of the human pathogen Candida albicans, is a member of a conserved class of 
bHLH proteins regulating morphogenetic processes in fungi. EMBO 16:1982-1991.); and a C. 

10 albicans transformation vector containing the C. albicans URA3 gene, a Candida ARS sequence, 
and a portion of the Saccharomyces cerevisiae 2 microns circle containing the replication origin 
was constructed. Goshorn et al., Infect Immun 60(3):876-84 (1992). A variety of other vectors 
suitable for use in foregoing organisms or in any other organism in which the present invention is to 
be practiced are familiar to those skilled in the art. 

15 Underexpression of a proliferation-required gene product described herein may be obtained 

in a variety of ways. For example, in one embodiment underexpression of the proliferation- 
required gene product may be achieved by providing an agent, such as an antisense nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, an 
antisense nucleic acid comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, 

20 or 500 consecutive nucleotides of a nucleotide sequence selected from the group consisting of SEQ 
ED NOs.: 1-6213, a nucleic acid complementary to a nucleic acid comprising a nucleotide sequence 
selected from the group consisting of SEQ ID NOs.: 6214-42397, a nucleic acid complementary to 
a nucleic acid comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 
consecutive nucleotides of a nucleotide sequence selected from the group consisting of SEQ ID 

25 NOs.: 6214-42397, a nucleic acid complementary to a nucleic acid which encodes a polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ ID NOs.: 42398- 
78581, a nucleic acid complementary to a nucleic acid which encodes at least 5, 10, 15, 20, 25, 30, 
35, 40, 50, 75, 100, or 150 consecutive amino acids of a polypeptide sequence selected from the 
group consisting of SEQ ID NOs.: 42398-78581, a homologous antisense nucleic acid, an antisense 

30 nucleic acid comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 
consecutive nucleotides of a homologous nucleic acid, a nucleic acid complementary to a 
homologous coding nucleic acid, a nucleic acid complementary to at least 10, 15, 20, 25, 30, 35, 40, 
50, 75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides of a homologous coding nucleic acid, 
a nucleic acid complementary to a nucleic acid which encodes a homologous polypeptide, or a 

35 nucleic acid complementary to a nucleic acid which encodes at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 
75, 100, or 150 consecutive amino acids of a homologous polypeptide, which reduces the level or 
activity of the gene product within the cell. In one embodiment, the agent may comprise an 
antisense nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 
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ID NOs.: 1-6213 which is complementary to a nucleic acid encoding the proliferation-required gene 
product or complementary to a portion of a nucleic acid encoding the proliferation-required gene 
product. 

In one example of antisense-inhibition-based underexpression, a nucleic acid which 
5 encodes the antisense nucleic acid may be operably linked to a regulatable promoter. When grown 
under appropriate conditions, such as media containing an inducer of transcription or an agent 
which alleviates repression of transcription, the antisense nucleic acid is expressed in the cell, 
thereby reducing the level or activity of the gene product within the cell. In some embodiments, the 
concentration of the inducer of transcription or the agent which alleviates repression of transcription 
10 may be varied to provide optimal results. Such methods have been described previously herein and 
in U.S. Patent Application Serial Number 09/815,242, U.S. Patent Application Serial Number 
09/492,709, U.S. Patent Application Serial Number 09/711,164, or U.S. Patent Application Serial 
Number 09/741,669. 

Alternatively, underexpression of a proliferation-required gene product described herein 

15 may be achieved by constructing strains in which the expression of the gene product is under the 
control of a constitutive or regulatable promoter using methods such as those described above with 
respect to methods in which the gene product is overexpressed. To provide cells which 
underexpress the gene product, the cells are grown under conditions in which the gene product is 
expressed at a level lower than that of a wild type cell. For example, the cells may be grown under 

20 conditions in which a repressor reduces the level of transcription from the regulatable promoter. 

In other embodiments, underexpression may be achieved by operably linking the gene 
required for proliferation to a desired promoter in a vector as described above with respect to 
embodiments in which gene products required for proliferation are overexpressed. In some 
embodiments, the vector may be present in cells in which the chromosomal copy or copies of the 

25 gene has been disrupted. 

Examples of gene products, which are encoded by genes that can be underexpressed using 
methods such as those described above with respect to methods in which the gene product is 
overexpressed include a gene product whose activity or level is inhibited by a nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 1-6213, a 

30 gene product encoded by a nucleic acid comprising a nucleotide . sequence selected from the group 
consisting of SEQ ID NOs.: 6214-42397, a gene product comprising an amino acid sequence 
selected from the group consisting of SEQ ID NOs.: 42398-78581, a gene product whose activity or 
level is inhibited by a homologous antisense nucleic acid, a gene product encoded by a homologous 
coding nucleic acid, and a gene product comprising a homologous polypeptide. 

35 One embodiment of the invention includes a method for identifying a gene product 

described herein on which a compound which inhibits the proliferation of an organism acts. The 
method employs a culture which comprises a mixture of strains of the organism. At least some of 
the strains in the culture overexpress a different gene product which is required for the proliferation 
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of the organism. Preferably, each of the strains in the culture overexpresses a different gene 
product which is required for proliferation of the organism (i.e. all of the strains in the culture 
overexpress a gene product which is required for proliferation of the organism). For example, the 
gene product which is overexpressed in each strain may be a gene product whose activity or level is 

5 inhibited by a nucleic acid comprising a nucleotide sequence selected from the group consisting of 
SEQ ID NOs.: 1-6213, a gene product encoded by a nucleic acid comprising a nucleotide sequence 
selected from the group consisting of SEQ ID NOs.: 6214-42397, a gene product comprising an 
amino acid sequence selected from the group consisting of SEQ ID NOs.: 42398-78581, a gene 
product whose activity or level is inhibited by a homologous antisense nucleic acid, a gene product 

10 encoded by a homologous coding nucleic acid, and a gene product comprising a homologous 
polypeptide. 

Strains that overexpress the proliferation-required gene product may be obtained using the 
methods described above. The culture may comprise any number of strains which overexpress a 
gene product required for proliferation. For example the culture may comprise at least two strains, 

15 at least 10 strains, at least 20 strains, at least 30, strains, at least 50 strains, at least 100 strains, at 
least 300 strains or more than 300 strains which overexpress a gene product required for 
proliferation. In some embodiments, the culture may comprise strains which in aggregate 
overexpress all or most of the gene products required for proliferation of the organism. 

The culture is contacted with a compound which inhibits proliferation of the organism. The 

20 compound may be a candidate drug compound obtained from any source. For example, the 
compound may be a compound generated using combinatorial chemistry, a compound from a 
natural product library, or an impure or partially purified compound, such as a compound in a 
partially purified natural extract. The culture is contacted with a sufficient concentration of the 
compound to inhibit the proliferation of strains of the organism in the culture which Jo not 

25 overexpress the gene product on which the compound acts, such that strains which overexpress said 
gene product on which the compound acts proliferate more rapidly in the culture than strains which 
do not overexpress said gene product on which said compound acts. Thus, after a sufficient period 
of time, the strain which overexpresses the gene product on which the compound acts will be more 
prevalent in the culture than strains which do not overexpress the gene product on which the 

30 compound acts. In a preferred embodiment, the growth conditions and incubation period are 
selected so that only one strain, the strain overexpressing the target of the compound, is recovered 
from the culture. Thus, in one embodiment, a plurality of cultures containing a plurality of strains 
each of which overexpresses a different proliferation-required gene product may be grown in the 
presence of varying concentrations of the compound. In addition to varying the compound 

35 concentrations, in embodiments where expression of the proliferation-required gene product is 
under the control of a regulatable promoter, the plurality of cultures may be grown at varying 
concentrations of an agent which regulates the level of expression from the promoter, such as an 
inducer or an agent which reduces the effect of a repressor on transcription from the promoter. It 



WO 02/077183 PCT/US02/09107 
will be appreciated, that the cultures may be grown in liquid medium in the presence of the 
compound whose target is to be identified (and where appropriate in the presence of an agent which 
regulates the level of expression from the promoter) or alternatively, a liquid culture comprising the 
strains which overexpress the proliferation-required gene products may be grown in the absence of 

5 the compound whose target is to be identified and then introduced onto a solid medium containing 
the compound (and, where appropriate, also containing an agent which regulates the level of 
expression from the promoter). 

The identity of the overexpressed gene product which is the target of the compound may be 
determined using a variety of methods. For example, in some embodiments of the present 

10 invention, the nucleic acids present in the culture or collection of strains which was contacted with 
the compound may be compared to the nucleic acids present in a control culture or collection of 
strains which was not contacted with the compound to identify nucleic acids which are 
oveiTepresented in the culture or collection of strains contacted with the test compound relative to 
the control culture or collection of strains. Alternatively, in some embodiments, the nucleic acids 

15 present in a culture or collection of strains contacted with the test compound may be analyzed to 
identify those nucleic acids which are present without comparison to a control culture or collection 
of strains. 

In some embodiments, the strains which proliferated more rapidly in the culture or 
collection of strains, i.e. strains having an enhanced ability to proliferate in the presence of a test 

20 compound relative to other strains in the culture or collection of strains, are identified as follows. 
Amplification products which are correlated with each of the overexpressed genes and which are 
distinguishable from one another are obtained from a culture or collection grown in the presence of 
a test compound. The amplification products are distinguished from one another to determine 
whether a particular amplification product is overrepresented in the culture or collection of strains. 

25 In some embodiments, the amplification products corresponding to each of the gene products have 
lengths which permit them to be distinguished from one another. In another embodiment, one or 
more of the amplification products have similar or identical lengths but are distinguishable from 
one another based on a detectable agent, such as a dye, attached thereto. In some embodiments, 
amplification products which are overrepresented are identified by comparing the amplification 

30 products from the culture or collection of strains which was contacted with the test compound to the 
amplification products from a culture or collection of strains which was not contacted with the test 
compound. Alternatively, amplification products which are overrepresented may be identified by 
simply identifying the amplification products obtained from the culture or collection of strains 
contacted with the test compound (for example, only one or a few strains may have proliferated in 

35 the presence of the test compound). The above methods for generating distinguishable 
amplification products may be used in conjunction with any of the methods for generating strains 
which overexpress gene products required for proliferation described herein in order to facilitate the 
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identification of strains which proliferate more rapidly or more slowly in the presence of a test 
compound. 

For example, in some embodiments of the present invention, each of the native promoters 
of each of the genes encoding gene product required for proliferation are replaced by a single 
5 desired replacement promoter. After growth of the culture or collection of strains containing the 
strains in which the promoters have been replaced in the presence of a test compound for a desired 
period of time, an amplification reaction is performed on nucleic acids obtained from the culture as 
follows. 

The nucleic acids from the culture or collection of strains may be divided into at least two 

10 aliquots if desired. In a preferred embodiment the nucleic acids from the culture or collection of 
strains are divided into four aliquots. A single primer complementary to a nucleotide sequence 
within the replacement promoter , within the proliferation required genes, or within nucleic acid 
sequences adjacent to the promoter or proliferation required genes is divided into at least two 
portions, one portion for each aliquot of nucleic acids. Each portion of the primer is labeled with a 

15 distinct detectable dye, such as the 6FAM™, TET™, VIC™, HEX™, NED™, and PET™ dyes 
obtainable from Applied Biosystems (Foster City, CA). For example, the DS-31 or DS-33 dye sets 
available from Applied Biosystems (Foster City, CA) may be used to label the primers. 
Alternatively, the HEX™, NED, JOE, TMR and TET™ dyes available from Amersham 
Biosciences may be used. Thus, if the nucleic acids from the culture are not divided into aliquots, a 

20 single primer labeled with a single dye may be used. If the nucleic acids from the culture are 
divided into aliquots, at least 2, at least 3, at least 4 or more than 4 primers labeled with 
distinguishable dyes may be used. Each of the portions of labeled primers are added to each of the 
aliquots of the nucleic acids from the culture or collection of strains such that each aliquot of 
nucleic acid receives a single labeled primer with a single detectable dye thereon. In some 

25 embodiments, the primers are divided into 3 portions, 4 portions or more than 4 portions, with each 
portion having a dye which is distinguishable from the dyes on the other portions thereon. 

Each of the aliquots of nucleic acids also receives a set of unlabeled primers, with each of 
the unlabeled primers being complementary to a nucleotide sequence within the promoter, within a 
nucleotide sequence which is unique to one of the genes encoding gene products required for 

30 proliferation which were placed under the control of the replacement promoter, or within nucleotide 
sequences adjacent to the promoter or proliferation required genes. Each of the aliquots receives 
primers unique to 1/N proliferation required genes which were placed under the control of the 
replacement promoter, where N is the number of aliquots (i.e. if the culture or collection of strains 
consisted of 100 strains in which a gene required for proliferation was placed under the control of 

35 the replacement promoter and was divided into four aliquots, then each of the four aliquots of 
nucleic acids from the culture or collection of strains would receive primers complementary to 25 
of the genes). The unlabeled primers are selected so that each will yield an amplification product 
having a length distinguishable from the length of the amplification product produced with the other 
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unlabeled primers. Preferably, the amplification products are between about 100-about 400 
nucleotides in length, but any lengths which may be distinguished from each other may be used. In 
addition, in some of tfie embodiments some of the amplification products may have identical or 
very similar lengths but be distinguishable from one another due to labeling with distinguishable 
5 dyes. 

A nucleic acid amplification reaction is conducted on each of the nucleic acid aliquots. The 
amplification products are then separated by length to identify amplification products having 
increased representation in the culture or collection of strains (i.e. amplification products derived 
from cells which proliferated more rapidly in the culture or collection of strains). The amplification 

10 products are then correlated with the corresponding genes to determine which strains proliferated 
more rapidly in the culture or collection of strains. If desired, amplification products having 
increased representation in the culture may be identified by comparing the amplification products 
obtained from a culture or collection of strains which was contacted with the compound to 
amplification products obtained from a control culture or collection of strains which was not 

15 contacted with the compound. Alternatively, if desired, the amplification products which are 
obtained from a culture which was contacted with the compound may be directly identified without 
comparison to a control culture which was not contacted with the compound. 

For example, in some embodiments, the amplification products from each of the nucleic 
acid aliquots are pooled and subjected to capillary electrophoresis. The amplification products are 

20 detected by detecting the fluorescent dyes attached thereto and their lengths are determined to 
identify those amplification products having increased or decreased representation in the culture or 
collection of strains. Figures 2A and 2B illustrate one embodiment of this method in which the 
absence of an amplification product from an amplification reaction performed on a culture 
comprising a plurality of strains underexpressing genes required for proliferation indicates that a 

25 test compound acts on the gene corresponding to the missing amplification product. It will be 
appreciated that the method may also be used to identify an amplification product which is 
overrepresented in an amplification reaction conducted on a culture or collection of strains 
overexpressing genes required for proliferation because the test compound acted on the 
corresponding gene. 

30 Alternatively, in another embodiment, a first amplification reaction is performed on nucleic 

acids obtained from a culture or collection of strains which was contacted with the compound using 
a first primer complementary to a nucleotide sequence present upstream or downstream of all of the 
overexpressed genes (such as a primer complementary to a nucleotide sequence in a replacement 
promoter upstream of all of the overexpressed genes) and a set of primers complementary to a 

35 nucleotide sequence unique to each of the strains (such as a primer complementary to a nucleotide 
sequence within each of the proliferation-required genes). One of the two amplification primers for 
each of the proliferation required genes is labeled with a dye as described above. Preferably, the 
common primer complementary to a nucleotide sequence upstream or downstream of all of the 

181 
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overexpressed genes is labeled with the dye. The primers used in the amplification reaction are 
designed so that the amplification product corresponding to each proliferation-required gene has a 
unique length or a dye which allows it to be distinguished from other amplification products of the 
same length. A second amplification reaction is conducted on a control culture or collection of 
5 strains which was not contacted with the compound using the same primers as in the first 
amplification reaction. The amplification products from the first amplification reaction are 
compared to those from the second amplification reaction to identify one or more amplification 
products which are overrepresented in the culture or collection of strains. For example, the 
amplification products from the first amplification reaction may be run in a separate lane of a 

10 polyacrylamide gel or a separate capillary than the amplification products from the second 
amplification reaction and the two lanes or capillaries are compared to one another. If desired, in 
the embodiment where the amplification products from the first amplification reaction are run in a 
different lane or capillary than the amplification products from the second amplification reaction, 
the same dye may be used to label the primers in the first and second amplification reactions. 

15 Alternatively, if desired, different dyes may be used to label the primers in the first and second 
amplification reactions. If desired, in the embodiment where the amplification products from the 
first amplification reaction are run in a different lane or capillary than the amplification products 
from the second amplification reaction, the same dye may be used to label the primers in the first 
and second amplification reactions. Alternatively, if desired, different dyes may be used to label 

20 the primers in the first and second amplification reactions. 

Alternatively, in some embodiments, the primers in the second amplification reaction are 
labeled with a different dye which is distinguishable from the dye used in the first amplification 
reaction. In this embodiment, the amplification reactions may be pooled and run in the same lane 
on a polyacrylamide gel or in the same capillary and the products from each amplification reaction 

25 are compared by comparing the amount of each dye present for each amplification product. Figures 
3A and 3B illustrate one embodiment of this method in which the absence of an amplification 
product from the amplification reaction performed on a culture comprising a plurality of strains 
underexpressing genes required for proliferation which was contacted with the compound indicates 
that a test compound acts on the gene corresponding to the missing amplification product. It will be 

30 appreciated that the method may also be used to identify an amplification product which is 
overrepresented in an amplification reaction conducted on a culture or collection of strains 
overexpressing genes required for proliferation because the test compound acted on the 
corresponding gene. 

If desired, rather than dividing the culture into aliquots, individual amplification reactions 
35 may be conducted on nucleic acids obtained from the culture or collection of strains. Each 
amplification reaction contains primers which will yield an amplification product specific for only 
one of the proliferation required genes. The resulting amplification products from each of the 
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individual amplification reactions are pooled and amplification products having increased 
representation in the culture are identified as described above. 

In another embodiment, a culture or collection of strains in which gene products required 
for proliferation are overexpressed from regulatable promoters which replaced the native promoters 

5 of the genes encoding these gene products is allowed to grow in the presence of a test compound for 
a desired number of generations. Preferably, the culture or collection of strains is allowed to grow 
in the presence of the test compound for at least 20 generations. Nucleic acids are isolated from the 
culture or collection of strains and an amplification reaction is performed using a primer which is 
complementary to a nucleotide sequence within the replacement promoter(s) or a nucleotide 

10 sequence adjacent to the a 5' end thereof and primers which are complementary to a nucleotide 
sequence within the proliferation required genes or nucleotide sequences adjacent thereto. The 
resulting amplification product(s) is directly sequenced using a primer complementary to a 
nucleotide sequence within the replacement promoter. 

In one embodiment of the present invention, the vector containing the nucleotide sequence 

15 encoding the proliferation-required gene product is obtained from a strain which proliferated more 
rapidly in the culture using methods such as plasmid preparation techniques. Nucleic acid 
sequencing techniques are then employed to determine the nucleotide sequence of the gene which 
was overexpressed. 

Alternatively, the identity of the overexpressed gene product which is the target of the 

20 compound may be determined by performing a nucleic acid amplification reaction, such as a 
polymerase chain reaction (PCR), to identify the nucleotide sequence of the gene which was 
overexpressed. For example, aliquots of a nucleic acid preparation, such as a purified plasmid, 
from the strain which is recovered from the culture may each be contacted with pairs of PCR 
primers which would amplify a different proliferation-required gene to determine which pair of 

25 primers yields an amplification product. 

An alternative method for determining the identity of the gene product described herein 
which is the target of the compound involves obtaining a nucleic acid array, such as a DNA chip, 
which contains each of the proliferation-required genes which were overexpressed in the strains in 
the culture. Each proliferation-required gene occupies a known location in the array. A nucleic 

30 acid preparation, such as a plasmid preparation, from the recovered strain is labeled with a 
detectable agent, such as radioactive or fluorescent moiety, and placed in contact with the nucleic 
acid array under conditions which permit the labeled nucleic acid to hybridize to complementary 
nucleic acids on the array. The location on the array to which the labeled nucleic acids hybridize is 
determined to identify the gene which was overexpressed in the recovered strain. If desired the 

35 hybridized nucleic acids from a culture which was contacted with the compound may be compared 
to the hybridized nucleic acids from a control culture which was not contacted with the compound. 
Alternatively, the hybridized nucleic acids from a culture which was contacted with the compound 
may be directly identified without comparison to nucleic acids from a control culture. 
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In some instances, more than one strain may proliferate more rapidly in the presence of the 
compound. This may result from a variety of causes. For example, the concentration of the 
compound may not have heen high enough to restrict proliferation only to cells which overexpress 
one gene product (i.e. the target gene product). While strains which overexpress the target gene 

5 product will be the most prevalent strain in the culture, other strains may also have proliferated. In 
such instances, the identity of the gene product in the strain which is most prevalent in the culture 
may be identified by quantitating the levels of each of the genes encoding proliferation-required 
proteins in the culture. This may be accomplished by quantitative PGR, DNA sequencing, 
hybridization, or array technology as described above. 

10 In other instances, multiple strains will exhibit more rapid proliferation in the culture as a 

result of a common functional attribute. For example, the strains which proliferate more rapidly 
may each overexpress a gene product with a common enzymatic activity, such as serine protease 
activity for example. Alternatively, the strains which proliferate more rapidly may each 
overexpress a gene product with a common functional domain, such as a cAMP binding domain. In 

15 such instances, the common attribute of the strains which proliferate more rapidly may provide 
information as to the mode of action of the compound or the biochemical activity of the target of 
the compound. For example, if all of the overexpressed genes in the strains which proliferated 
more rapidly are serine proteases, the compound acts by inhibiting serine protease activity and the 
target protein is a serine protease. If desired, the compound may be derivatized and the efficacy of 

20 the derivatized compound against each of the strains which proliferated more rapidly may be 
assessed as described herein in order to identify derivatives which are capable of interacting with a 
wide range of targets sharing a common activity or binding site (i.e. derivatives which have a 
greater ability to inhibit the proliferation of all the strains than the original compound) or to identify 
derivatives having greater specificity for a desired target (i.e. derivatives which have a greater 

25 specificity for one of the strains than the original compound). For example, it is possible that a 
nonessential gene product expressed in the cell might also bind to the initial test compound in 
addition to the gene product required for proliferation. In such an instance, it is desirable to obtain 
a derivative of the initial test compound which is specific for the gene product required for 
proliferation. In addition, it is possible that two gene products required for proliferation might bind 

30 to the initial test compound but specificity for one of the gene products is desired. 

Rather than employing a single culture which contains multiple strains each of which 
overexpresses a proliferation-required gene product described herein, the methods of the present 
invention may be performed using an array of individual strains (i.e. a collection of strains) each of 
which overexpresses a different proliferation-required gene product. For example, individual 

35 strains each overexpressing a different proliferation-required gene product may be grown in 
different wells of a multiwell plate. Each well is contacted with the compound (and, where 
appropriate an agent which regulates the level of expression from the promoter). The level of 
proliferation of the strains in each of the wells is determined to identify a strain which proliferated 
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more rapidly. The identity of the overexpressed gene product in the strain that proliferated more 
rapidly is determined as described above. 

In another embodiment, individual strains each overexpressing a different proliferation- 
required gene product (i.e. a collection of strains) are grown at different locations on a solid 
5 medium, such as an agar plate. The medium contains the compound and where appropriate an 
agent which regulates the level of expression from the promoter). The level of proliferation of each 
of the strains is determined to identify a strain which proliferated more rapidly. The identity of the 
overexpressed gene product in the strain that proliferated more rapidly is determined as described 
above. 

10 The above methods may be used to prioritize compound development or to determine 

whether the compound has been previously identified or whether the target of the compound is the 
target of a previously identified drug. In particular, if the product is a natural product, it is 
advantageous to determine whether it has been previously identified prior to investing significant 
effort in developing it. Thus, in some embodiments of the present invention, the target of a partially 

15 purified or purified natural product or a compound produced by combinatorial chemistry is 
identified using the methods described above and compared to the targets of known drugs. If the 
target is identical to that of a known drug, further development of the compound is halted. 

Alternatively, an array of strains each of which overexpresses a different gene product 
described herein (i.e. a collection of strains) is grown on solid medium containing a compound to be 

20 evaluated. The location of each strain in the array and the gene product overexpressed by that strain 
. is known. The pattern of colonies which grow in the presence of the compound is evaluated and 
compared to the pattern of colonies which grow in the presence of previously identified drugs. If 
the pattern of colonies which grow in the presence of the compound being evaluated is the same as 
the pattern of colonies which grow in the presence of a previously identified drug, further 

25 development of the compound is halted. 

Additionally in some embodiments, the sequence of the gene product in a strain which 
proliferated more rapidly in the assays described above is compared to the sequence of gene 
products from heterologous organisms to determine the likely spectrum of species whose growth 
would be inhibited by the compound. If the gene product has a high degree of homology to gene 

30 products from heterologous species, it is likely that the compound would also inhibit the growth of 
these heterologous species. Homology may be determined using any of a variety of methods 
familiar to those skilled in the art. For example, homology may be determined using a computer 
program such as BLASTP or FASTA. The ability of the compound to inhibit the growth of the 
heterologous species may then be confirmed by comparing the growth of cells of the heterologous 

35 species in the presence and absence of the compound. 

Current methods for identifying the target of compounds which inhibit cellular proliferation 
are laborious and time consuming. The above methods may be employed to allow the targets of a 
large number of compounds to be rapidly identified. In such methods, the methods described above 
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are simultaneously performed for each of a large number of compounds. For example, the 
compounds may be members of a library of compounds generated using combinatorial chemistry or 
members of a natural product library. In such methods, a plurality of cultures each comprising a 
plurality of strains each of which overexpresses a different gene product required for proliferation 
5 or a plurality of collections of individual strains each of which overexpresses a different gene 

product required for proliferation is obtained. Each culture or collection of strains is contacted with 
a different compound in the library and the target of the compound is identified as described above. 

In another embodiment, the gene product described herein on which a compound which 
inhibits the proliferation of an organism acts is identified using a culture which comprises a mixture 

10 of strains of the organism including strains which underexpress a different gene product which is 
required for proliferation of the organism (i.e. at least some of the strains in the culture 
underexpress a gene product which is required for proliferation of the organism). Preferably, each 
of the strains in the culture underexpress a different a gene product which is required for the 
proliferation of the organism (i.e. all of the strains in the culture underexpress a gene product which 

15 is required for the proliferation of the organism). In some embodiments, the culture comprises at 
least one strain which underexpresses a gene product selected from the group consisting of a gene 
product whose activity or level is inhibited by a nucleic acid comprising a nucleotide sequence 
selected from the group consisting of SEQ ED NOs.: 1-6213, a gene product encoded by a nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 6214- 

20 42397, a gene product comprising an amino acid sequence selected from the group consisting of 
SEQ ID NOs.: 42398-78581, a gene product whose activity or level is inhibited by a homologous 
antisense nucleic acid, a gene product encoded by a homologous coding nucleic acid, and a gene 
product comprising a homologous polypeptide. 

Strains underexpressing the proliferation-required gene products described herein may be 

25 obtained using the methods described above. The culture may comprise any number of strains. For 
example the culture may comprise at least two strains, at least 10 strains, at least 20 strains, at least 
30, strains, at least 50 strains, at least 100 strains, at least 300 strains or more than 300 strains which 
underexpress a gene product required for proliferation. In some embodiments, the strains in the 
culture in aggregate may underexpress all or most of the gene products required for proliferation of 

30 the organism. 

The culture is contacted with a compound which inhibits proliferation of the organism. The 
compound may be a candidate drug compound obtained from any source. For example, the 
compound may be a compound generated using combinatorial chemistry, a compound from a 
natural product library, or an impure or partially purified compound, such as a compound in a 
35 partially purified natural extract. The culture is contacted with a sufficient concentration of the 
compound to inhibit the proliferation of strains of the organism in the culture which underexpress 
the gene product on which the compound acts, such that strains which do not underexpress the gene 
product on which the compound acts proliferate more rapidly in the culture than strains which do 
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underexpress said gene product on which said compound acts. Thus, after a sufficient period of 
time, the strain which underexpresses the gene product on which the compound acts will be less 
prevalent in the culture than strains which do not underexpress the gene product on which the 
compound acts. In one embodiment, the growth conditions and incubation period are selected so 
5 that only one strain, the strain underexpressing the target of the compound, proliferates at a reduced 
rate in the culture. In another embodiment, the growth conditions may be selected so that the strain 
underexpressing the target of the compound is not recovered from the culture. Thus, in one 
embodiment, a plurality of cultures containing a plurality of strains each of which underexpresses a 
different proliferation-required gene product may be grown in the presence of varying 

10 concentrations of the compound. In addition to varying the compound concentrations, in 
embodiments where expression of the proliferation-required gene product is under the control of a 
regulatable promoter, the plurality of cultures may be grown at varying concentrations of an agent 
which regulates the level of expression from the promoter, such as an inducer or an agent which 
reduces the effect of a repressor on transcription from the promoter. It will be appreciated, that the 

15 cultures may be grown in liquid medium in the presence of the compound whose target is to be 
identified (and where appropriate in the presence of an agent which regulates the level of expression 
from the promoter) or alternatively, a liquid culture comprising the strains which underexpress the 
proliferation-required gene products may be grown in the absence of the compound whose target is 
to be identified and then introduced onto a solid medium containing the compound (and, where 

20 appropriate, also containing an agent which regulates the level of expression from the promoter). 

The identity of the underexpressed gene product which is the target of the compound may 
be determined using a variety of methods. For example, in some embodiments of the present 
invention, the nucleic acids present in the culture or collection of strains which was contacted with 
the compound may be compared to the nucleic acids present in a control culture or collection of 

25 strains which was not contacted with the compound to identify nucleic acids which are 
underrepresented in the culture or collection of strains contacted with the test compound relative to 
the control culture or strains. Alternatively, in some embodiments, the nucleic acids present in a 
' culture or collection of strains contacted with the test compound may be analyzed to identify those 
nucleic acids which are missing or present at reduced levels without comparison to a control culture 

30 or collection of strains. 

In some embodiments of the present invention, the strains which proliferated more slowly 
in the culture or collection of strains, i.e. strains having an decreased ability to proliferate in the 
presence of a test compound or which do not proliferate in the presence of a test compound, are 
identified as follows. Amplification products which are correlated with each of the underexpressed 

35 genes and which are distinguishable from one another are obtained from a culture or collection 
grown in the presence of a test compound. The amplification products are distinguished from one 
another to determine whether a particular amplification product is underrepresented in the culture or 
collection of strains. In some embodiments, the amplification products corresponding to each of the 
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gene products have lengths which permit them to be distinguished from one another. In another 
embodiment, one or more of the amplification products have similar or identical lengths but are 
distinguishable from one another based on a detectable agent, such as a dye, attached thereto. In 
some embodiments, amplification products which are underrepresented are identified by comparing 
5 the amplification products from the culture or collection of strains which was contacted with the 
test compound to the amplification products from a culture or collection of strains which was not 
contacted with the test compound. Alternatively, amplification products which are 
unden-epresented in the culture or collection of strains may be identified simply by determining 
which amplification products are missing or present at reduced levels in the culture or collection of 
10 strains. The above methods for generating distinguishable amplification products may be used in 
conjunction with any of the methods for generating strains which underexpress gene products 
required for proliferation described herein in order to facilitate the identification of strains which 
proliferate more slowly in the presence of a test compound. 

For example, in some embodiments of the present invention, each of the native promoters of each 

15 of the genes encoding gene product required for proliferation are replaced by a single desired 
replacement promoter. After growth of the culture or collection of strains containing the strains in 
which the promoters have been replaced in the presence of a test compound for a desired period of 
time, an amplification reaction is performed on nucleic acids obtained from the culture as follows. 

The nucleic acids from the culture or collection of strains are divided into at least two 

20 aliquots. In a preferred embodiment the nucleic acids from the culture or collection of strains are 
divided into four aliquots. A single primer complementary to a nucleotide sequence within the 
replacement promoter , within the proliferation required genes, or within nucleic acid sequences 
adjacent to the promoter or proliferation required genes is divided into four groups Each group is 
labeled with a distinct detectable dye, such as the 6FAM™, TET™, VIC™, HEX™, NED™, and 

25 PET™ dyes obtainable from Applied Biosystems (Foster City, CA). For example, the DS-31 or 
DS-33 dye sets available from Applied Biosystems (Foster City, CA) may be used to label the 
primers. Each of the groups of labeled primers are added to each of the aliquots of the nucleic acids 
from the culture or collection of strains such that each aliquot of nucleic acid receives a single 
labeled primer with a single detectable dye thereon. 

30 Each of the aliquots of nucleic acids also receives a set of unlabeled primers, with each of 

the unlabeled primers being complementary to a nucleotide sequence within the promoter, within a 
nucleotide sequence which is unique to one of the genes encoding gene products required for 
proliferation which were placed under the control of the replacement promoter, or within nucleotide 
sequences adjacent to the promoter or proliferation required genes. Each of the aliquots receives 

35 primers unique to 1/N proliferation required genes which were placed under the control of the 
replacement promoter, where N is the number of aliquots (i.e. if the culture or collection of strains 
consisted of 100 strains in which a gene required for proliferation was placed under the control of 
the replacement promoter and was divided into four aliquots, then each of the four aliquots of 
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nucleic acids from the culture or collection of strains would receive primers complementary to 25 
of the genes). The unlabeled primers are selected so that each will yield an amplification product 
having a length distinguishable from the length of the amplification product produced with the other 
unlabeled primers. Preferably, the amplification products are between about 100-about 400 
5 nucleotides in length, but any lengths which may be distinguished from each other may be used. In 
addition, in some of the embodiments some of the amplification products may have identical or 
very similar -lengths but be distinguishable from one another due to labeling with distinguishable 
dyes. 

A nucleic acid amplification reaction is conducted on each of the nucleic acid aliquots. The 

10 amplification products are then separated by length to identify amplification products decreased 
representation or which are absent in the culture or collection of strains. The amplification products 
are then correlated with the corresponding genes to determine which strains proliferated more 
slowly in the culture or collection of strains. If desired, amplification products having decreased 
representation in the culture may be identified by comparing the amplification products obtained 

15 from a culture or collection of strains which was contacted with the compound to amplification 
products obtained from a control culture or collection of strains which was not contacted with the 
compound. Alternatively, if desired, the amplification products which are missing or present at 
reduced levels in a culture which was contacted with the compound may be directly identified 
without comparison to a control culture which was not contacted with the compound. 

20 For example, in some embodiments, the amplification products from each of the nucleic 

acid aliquots are pooled and subjected to capillary electrophoresis. The amplification products are 
detected by detecting the fluorescent dyes attached thereto and their lengths are determined to 
identify those amplification products having decreased representation in the culture or collection of 
strains. Figures 2A and 2B illustrate one embodiment of this method in which the absence of an 

25 amplification product from an amplification reaction performed on a culture comprising a plurality 
of strains underexpressing genes required for proliferation indicates that a test compound acts on 
the gene corresponding to the missing amplification product. 

Alternatively, in another embodiment, a first amplification reaction is performed on nucleic 
acids obtained from a culture or collection of strains which was contacted with the compound using 

30 a first primer complementary to a nucleotide sequence present upstream or downstream of all of the 
overexpressed genes (such as a primer complementary to a nucleotide sequence in a replacement 
promoter upstream of all of the overexpressed genes) and a set of primers complementary to a 
nucleotide sequence unique to each of the strains (such as a primer complementary to a nucleotide 
sequence within each of the proliferation-required genes). One of the two amplification primers for 

35 each of the proliferation required genes is labeled with a dye as described above. Preferably, the 
common primer complementary to a nucleotide sequence upstream or downstream of all of the 
overexpressed genes is labeled with the dye. The primers used in the amplification reaction are 
designed so that the amplification product corresponding to each proliferation-required gene has a 
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unique length. A second amplification reaction is conducted on a control culture or collection of 
strains which was not contacted with the compound using the same primers as in the first 
amplification reaction. The amplification products from the first amplification reaction are 
compared to those from the second amplification reaction to identify one or more amplification 
5 products which are underrepresented in the culture or collection of strains. For example, the 
amplification products from the first amplification reaction may be run in a separate lane of a 
polyacrylamide gel or a separate capillary than the amplification products from the second 
amplification reaction and the two lanes or capillaries are compared to one another. 

Alternatively, in some embodiments, the primers in the second amplification reaction are 

10 labeled with a different dye which is distinguishable from the dye used in the first amplification 
reaction. In this embodiment, the amplification reactions may be pooled and run in the same lane 
on a polyacrylamide gel or in the same capillary and the products from each amplification reaction 
are compared by comparing the amount of each dye present for each amplification product. Figures 
3A and 3B illustrate one embodiment of this method in which the absence of an amplification 

15 product from the amplification reaction performed on a culture comprising a plurality of strains 
underexpressing genes required for proliferation which was contacted with the compound indicates 
that a test compound acts on the gene corresponding to the missing amplification product. 

If desired, rather than dividing the culture into aliquots, individual amplification reactions 
may be conducted on nucleic acids obtained from the culture or collection of strains. Each 

20 amplification reaction contains primers which will yield an amplification product specific for only 
one of the proliferation required genes. The resulting amplification products from each of the 
individual amplification reactions are pooled and amplification products having decreased 
representation in the culture are identified as described above. 

In an alternative embodiment, the representation of each strain in the culture may be 

25 assessed by hybridizing detectably labeled nucleic acids encoding the proliferation-required gene 
products, or portions thereof, obtained from the culture to an array comprising nucleic acids 
encoding the gene products required for proliferation or portions thereof. Each nucleic acid 
encoding a gene product required for proliferation or portion thereof occupies a known location on 
the array. The signal from each location on the array is quantitated to identify those nucleic acids 

30 encoding a proliferation-required gene product which are underrepresented in the culture. If desired 
the hybridized nucleic acids from a culture which was contacted with the compound may be 
compared to the hybridized nucleic acids from a control culture which was not contacted with the 
compound. Alternatively, the hybridized nucleic acids from a culture which was contacted with the 
compound may be directly analyzed without comparison to nucleic acids from a control culture. 

35 In another alternative, each strain underexpressing a gene product required for proliferation 

may be constructed to contain a unique nucleic acid sequence (referred to herein as a "tag"). The 
tag may be included in the chromosome of each strain or in an extrachromosomal vector. For 
example, the tag could be included in a vector encoding an antisense nucleic acid complementary to 
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a gene encoding a gene product required for proliferation or a portion of such a gene or the tag may 
be included in the antisense nucleic acid itself . The representation of each strain in the culture may 
be assessed by performing an amplification reaction using primers complementary to each of the 
tags and quantitating the levels of the resulting amplification products to identify a tag which is 
5 underrepresented or absent from the culture. Since each tag corresponds to one strain, the strain 
which is underrepresented or absent from the culture may be identified. If desired the tags present 
in a culture which was contacted with the compound may be compared to the tags present in a 
control culture which was not contacted with the compound. Alternatively, the tags present in a 
culture which was contacted with the compound may be analyzed without comparison to a control 
10 culture. 

It will be appreciated that, if desired, unique tags may also be used in embodiments in 
which gene products required for proliferation are overexpressed. In some aspects of such 
embodiments, the tags may be within or adjacent to the promoter which drives expression of the 
gene encoding the gene product. In such embodiments, the gene product which is overexpressed in 

15 strains which proliferate more rapidly in the culture may be identified by detecting the presence or 
amount of the unique tag corresponding to that gene product in the culture. 

In some instances, more than one strain may proliferate less rapidly in the presence of the 
compound. This may result from a variety of causes. For example, the concentration of the 
compound may not have been high enough to reduce the proliferation only in cells which 

20 underexpress one gene product (i.e. the target gene product). While strains which underexpress the 
target gene product will be the least prevalent strain in the culture, other strains may also be 
underrepresented. hi such instances, the identity of the gene product in the strain which is least 
prevalent in the culture (or not recovered from the culture) may be identified by quantitating the 
levels of each of the genes encoding proliferation-required proteins in the culture. This may be 

25 accomplished by quantitative PCR, DNA sequencing, hybridization, or array technology as 
described above. 

In other instances, multiple strains will exhibit less rapid proliferation in the culture as a 
result of a common functional attribute. For example, the strains which proliferate less rapidly (or 
the strains which are not recovered from the culture) may each underexpress a gene product with a 

30 common enzymatic activity, such as serine protease activity for example. Alternatively, the strains 
which proliferate less rapidly (or the strains which are not recovered from the culture) may each 
underexpress a gene product with a common functional domain, such as a cAMP binding domain. 
In such instances, the common attribute of the strains which proliferate less rapidly (or the strains 
which are not recovered from the culture) may provide information as to the mode of action of the 

35 compound or the biochemical activity of the target of the compound. For example, if all of the 
underexpressed genes in the strains which proliferated less rapidly are serine proteases, the 
compound acts by inhibiting serine protease activity and the target protein is a serine protease. If 
desired, the compound may be derivatized and the efficacy of the derivatized compound against 
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each of the strains which proliferated more rapidly may be assessed as described herein in order to 
identify derivatives which are capable of interacting with a wide range of targets sharing a common 
activity or binding site (i.e. derivatives which have a greater ability to inhibit the proliferation of all 
the strains than the original compound) or to identify derivatives having greater specificity for a 
5 desired target (i.e. derivatives which have a greater specificity for one of the strains than the 
original compound). 

Rather than employing a single culture which contains multiple strains each of which 
underexpresses a proliferation-required gene product described herein, the methods of the present 
invention may be performed using an array of individual strains (i.e. a collection of strains) each of 

10 which underexpresses a different proliferation-required gene product. For example, individual 
strains each underexpressing a different proliferation-required gene product may be grown in 
different wells of a multiwell plate. Each well is contacted with the compound (and, where 
appropriate an agent which regulates the level of expression from the promoter). The level of 
proliferation of the strains in each of the wells is determined to identify a strain which proliferated 

15 less rapidly or which did not proliferate at all. The identity of the underexpressed gene product in 
the strain that proliferated less rapidly or which did not proliferate at all is determined as described 
above. 

In another embodiment, individual strains each underexpressing a different proliferation- 
required gene product (i.e. a collection of strains) are grown at different locations on a solid 

20 medium, such as an agar plate. The medium contains the compound and, where appropriate, an 
agent which regulates the level of expression from the promoter. The level of proliferation of each 
of the strains is determined to identify a strain which proliferated less rapidly (or a strain which is 
not recovered from the culture). The identity of the underexpressed gene product in the strain that 
proliferated less rapidly (or the strain which is not recovered from the culture) is determined as 

25 described above. 

The above methods may be used to prioritize compound development or to determine 
whether the compound has been previously identified or whether the target of the compound is the 
target of a previously identified drug. In particular, if the product is a natural product is 
advantageous to determine whether it has been previously identified prior to investing significant 

30 effort in developing it. Thus, in some embodiments of the present invention, the target of a partially 
purified or purified natural product or a compound produced by combinatorial chemistry is 
identified using the methods described above and compared to the targets of known drugs. If the 
target is identical to that of a known drug, further development of the compound is halted. 

Alternatively, an array of strains each of which underexpresses a different gene product 

35 described herein (i.e. a collection of strains) is grown on solid medium containing a compound to be 
evaluated. The location of each strain in the array and the gene product underexpressed by that 
strain is known. The pattern of colonies which grow less rapidly or fail to grow in the presence of 
the compound is evaluated and compared to the pattern of colonies which grow less rapidly or fail 
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to grow in the presence of previously identified drugs. If the pattern of colonies which grow less 
rapidly or fail to grow in the presence of the compound being evaluated is the same as the pattern of 
colonies which grow less rapidly or fail to grow in the presence of a previously identified drug, 
further development of the compound is halted. 
5 Additionally, the nucleotide sequence of the gene product described herein in a strain which 

proliferated less rapidly (or a strain which was not recovered from the culture) in the assays 
described above is compared to the nucleotide sequence of gene products from heterologous 
organisms to determine the likely spectrum of species whose growth would be inhibited by the 
compound. If the gene product has a high degree of homology to gene products from heterologous 

10 species, it is likely that the compound would also inhibit the growth of these heterologous species. 
Homology may be determined using any of a variety of methods familiar to those skilled in the art. 
For example, homology may be determined using a computer program such as BLASTP or 
FASTA. The ability of the compound to inhibit the growth of the heterologous species may then be 
confirmed by comparing the growth of cells of the heterologous species in the presence and absence 

15 of the compound. 

In other embodiments, the present invention uses collections or cultures of strains 
comprising both strains which overexpress gene products described herein required for cellular 
proliferation and strains which underexpress the same gene products required for cellular 
proliferation. The gene product which is overexpressed or underexpressed in each strain may be a 

20 gene product whose activity or level is inhibited by a nucleic acid comprising a nucleotide sequence 
selected from the group consisting of SEQ ED NOs.: 1-6213, a gene product encoded by a nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs.: 6214- 
42397, a gene product comprising an amino acid sequence selected from the group consisting of 
SEQ ID NOs.: 42398-78581, a gene product whose activity or level is inhibited by a homologous 

25 antisense nucleic acid, a gene product encoded by a homologous coding nucleic acid, and a gene 
product comprising a homologous polypeptide. 

The culture or collection of strains is contacted with a compound and the nucleic acids 
present in the culture or collection of strains are analyzed. Preferably, nucleic acids derived from 
overexpressing strains can be distinguished from those derived from underexpressing strains. For 

30 example, the overexpressing strains may be obtained using promoter replacement as described 
above while the underexpressing strains may be obtained by expressing antisense nucleic acids. 
Accordingly, in one embodiment, amplification primers may be designed which will uniquely 
amplify nucleic acids from the overexpressing strains or the underexpressing strains. If a 
compound acts on a gene product which was overexpressed and underexpressed in the culture, then 

35 the amplification product obtained from the strain in the culture ,or collection which overexpressed 
gene product will be overrepresented in the culture or collection while the amplification product 
obtained from the strain which underexpressed the gene product will be underrepresented in the 
culture or collection. If desired, nucleic acids from a culture or collection which was contacted with 
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the compound may be compared to nucleic acids from a control culture or collection which was not 
contacted with the compound. Alternatively, nucleic acids from a culture or collection which was 
contacted with the compound may be directly analyzed without comparison to a control culture or 
collection. 

5 In some embodiments, strains are constructed in which a nucleic acid complementary to a 

gene encoding a gene product described herein required for proliferation or a portion thereof is 
operably linked to a regulatable promoter. For example, in some embodiments, the strains may 
transcribe an antisense nucleic acid selected from the group consisting of SEQ ID NOs.: 1-6213 or 
fragments thereof which inhibit proliferation or reduce the activity or level of the gene product 

10 encoded by the gene comprising a nucleotide sequence complementary to the antisense nucleic acid 
or homologous antisense nucleic acids or fragments thereof. In other embodiments, the strains may 
transcribe an antisense nucleic acid which reduces the activity or level of a gene product encoded 
by SEQ ID NOs.: 6214^2397, the polypeptides of SEQ ID NOs.: 42398-78581, homologous 
coding nucleic acids or homologous polypeptides. A culture comprising a plurality of such strains 

15 wherein each strain expresses an antisense nucleic acid against a different gene product required for 
proliferation is grown in the presence of varying levels of a compound which inhibits proliferation 
and in the presence of varying levels of an agent which regulates the level of transcription from the 
regulatable promoter. Nucleic acids samples are obtained from the culture, detectably labeled and 
hybridized to a solid support comprising nucleic acids containing the genes encoding the 

20 proliferation-required gene products or a portion thereof. The level of hybridization is quantitated 
for each nucleic acid encoding each of the proliferation-required gene products to determine the rate 
at which each of the strains proliferated in the culture. If the antisense nucleic acid expressed by a 
strain in the culture is not complementary to all or a portion of the gene encoding the target of the 
compound (i.e. a nonspecific strain), then the hybridization intensity for that strain will not be 

25 correlated with the concentration of the compound (See Figure 4), while if the antisense nucleic 
acid expressed by a strain in the culture is complementary to all or a portion of the gene encoding 
the target of the compound, the hybridization intensity for that strain will be intimately correlated 
with the concentration of the compound (See Figure 5). In this manner, the target of the compound 
may be identified. It will be appreciated that, as described above, rather than growing the strains in 

30 a single culture, each strain may be grown in a different location on a solid medium or in a different 
well of a multiwell plate. 

The methods described above can be simultaneously performed for each of a large number 
of compounds. For example, the compounds may be members of a library of compounds generated 
using combinatorial chemistry or members of a natural product library. In such methods, a plurality 

35 of cultures each comprising a plurality of strains each of which overexpresses or underexpresses a 
different gene product required for proliferation or a plurality of collections of individual strains 
each of which overexpresses or underexpresses a different gene product required for proliferation is 
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obtained. Each culture or collection of strains is contacted with a different compound in the library 
and the target of the compound is identified as described above. 

In still another embodiment, the antisense nucleic acids of the present invention (including the 
antisense nucelic acids of SEQ ED NOs. 1-6213 fragments thereof or homologous antisense nucleic 
5 acids or fragements thereof) that inhibit bacterial growth or proliferation can be used as antisense 
therapeutics for killing bacteria. The antisense sequences can be complementary to one of SEQ ID 
NOs.: 6214-42397 or fragments thereof, homologous coding nucleic acids or fragments thereof. 
Alternatively, antisense therapeutics can be complementary to operons in which proliferation-required 
genes reside (i.e. the antisense nucleic acid may hybridize to a nucleotide sequence of any gene in the 

10 operon in which the prolifeTation-required genes reside). Further, antisense therapeutics can be 
complementary to a proliferation-required gene or portion thereof with or without adjacent noncoding 
sequences, an intragenic sequence (i.e. a sequence within a gene), an intergenic sequence (i.e. a 
sequence between genes), a sequence spanning at least a portion of two or more genes, a 5' noncoding 
region or a 3' noncoding region located upstream or downstream from the actual sequence that is 

1 5 required for bacterial proliferation or an operon containing a proliferation-required gene. 

In addition to therapeutic applications, the present invention encompasses the use of nucleic 
acids complementary to nucleic acids required for proliferation as diagnostic tools. For example, 
nucleic acid probes comprising nucleotide sequences complementary to proliferation-required 
sequences that are specific for particular species of cells or microorganisms can be used as probes to 

20 identify particular microorganism species or cells in clinical specimens. This utility provides a rapid 
and dependable method by which to identify the causative agent or agents of a bacterial infection. This 
utility would provide clinicians the ability to accurately identify the species responsible for the 
infection and amdminister a compound effective against it. In an extension of this utility, antibodies 
generated against proteins translated from mRNA transcribed from proliferation-required sequences 

25 can also be used to screen for specific cells or microorganisms that produce such proteins in a species- 
specific manner. 

Other embodiments of the present invention include methods of identifying compounds which 
inhibit the activity of gene products required for cellular proliferation using rational drug design. As 
discussed in more detail below, in such methods, the structure of the gene product is determined using 

30 techniques such as x-ray crystallography or computer modeling. Compounds are screened to identify 
those which have a structure which would allow them to interact with the gene product or a portion 
thereof to inhibit its activity. The compounds may be obtained using any of a variety of methods 
familiar to those skilled in the art, including combinatorial chemistry. In some embodiments, the 
compounds may be obtained from a natural product library. In some embodiments, compounds having 

35 a structure which allows them to interact with the active site of a gene product, such as the active site of 
an enzyme, or with a portion of the gene product which interacts with another biomolecule to form a 
complex are identified. If desired, lead compounds may be identified and further optimized to provide 
compounds which are highly effective against the gene product. 
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The followig examples teach the genes of the present invention and a subset of uses for the 
genes identified as required for proliferation. These examples are illustrative only and are not intended 
to limit the scope of the present invention. 

EXAMPLES 

5 The followmg examples are directed to the identification and exploitation of genes required for 

proliferation. Methods of gene identification are discussed as well as a variety of methods to utilize the 
identified sequences. It will be appreciated that any of the antisense nucleic acids, proliferartion- 
required genes or proliferation-required gene products described herein, or portions thereof, may be 
used in the procednres described below, including the antisense nucleic acids of SEQ ID NOs.: 1-6213, 

10 the nucleic acids of SEQ ID NOS.: 6214-42397, or the polypeptides of SEQ ID NOs.: 42398-78581. 
Likewise, homologous antisense nucleic acids, homologous coding nucleic acids, homologous 
polypeptides or portions of any of the above-mentioned nucleic acids or polypeptides, may be used in 
any of the procediaes described below. 

Genes Identified as Required for Proliferation of Escherichia coliy Staphylococcus aureus, 
1 5 Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas aeruginosa and Salmonella 
.typhimurium. 

Genomic fragments were operably linked to an inducible promoter in a vector and assayed for 
growth inhibition activity. Example 1 describes the examination of a library of genomic fragments 
cloned into vectors comprising inducible promoters. Upon induction with xylose or IPTG, the vectors 

20 produced an RNA molecule corresponding to the subcloned genomic fragments. In those instances 
where the genomic fragments were in an antisense orientation with respect to the promoter, the 
transcript produced was complementary to at least a portion of an mRNA (messenger RNA) encoding 
a Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, 
Pseudomonas aeruginosa or Salmonella typhimurium gene product such that they interacted with 

25 sense mRNA produced from various Escherichia coli t Staphylococcus aureus, Enterococcus 
faecalis, Klebsiella pneumoniae, Pseudomonas aemginosa or Salmonella typhimurium genes and 
thereby decreased the translation efficiency or the level of the sense messenger RNA thus decreasing 
production of the protein encoded by these sense mRNA molecules. In cases where the sense mRNA 
encoded a protein required for proliferation, bacterial cells containing a vector from which transcription 

30 from the promoter had been induced failed to grow or grew at a substantially reduced rate. 
Additionally, in cases where the transcript produced was complementary to at least a portion of a non- 
translated RNA and where that non-translated RNA was required for proliferation, bacterial cells 
containing a vector from which transcription from the promoter had been induced also failed to grow or 
grew at a substantially reduced rate. In contrast, cells grown under non-inducing conditions grow at a 

35 normal rate. 

The above method was used to identify genes required for cellular proliferation in Escherichia 
coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas 
aeruginosa and Salmonella typhimurium. Additionally, a number of genes required for cellular 
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proliferation in Escherichia coli, Staphylococcus aureus, Efiterococcus faecalis, Klebsiella 
pneumoniae, Pseudomonas aeruginosa and Salmonella typhimurium, which have been described in 
the following U.S. Patent Applications: U.S. Patent Application Serial Number 09/492,709, filed 
January 27, 2000; U.S. Patent Application Serial Number 09/711,164, filed November 9, 2000; 
5 U.S. Patent Application Serial Number 09/741,669, filed December 19, 2000 and U.S. Patent 
Application Serial Number 09/815,242 filed March 21, 2001, U.S. Provisional Patent Application 
Serial Number 60/342,923, filed October 25, 2001, have been previously identified using the above 
method. 

EXAMPLE 1 

10 Inhibition of Bacterial Proliferation after Induction of Antisense Expression 

To identify genes required for proliferation of E. coli, random genomic fragments were 
cloned into the IPTG-inducible expression vector pLEXSBA (Krause et al., J. Mol. Biol. 274: 365 
(1997) or a modified version of pLEX5BA, pLEX5BA-3* in which a synthetic linker containing a 
T7 terminator was ligated between the PstI and Hindin sites of pLEXSBA. In particular, to 

15 construct pLEX5BA-3 \ the following oligonucleotides were annealed and inserted into the PstI and 
Hindm sites of pLEX5BA: 

5' -GTCTAGCATAACCCCTTGGGGCCTCTAAACGGGTCCTTGAGGGGTTTTTTGA-3 ' (SEQ ID 
NO: 78584) 

5 1 - AGCTTCAAAAAACC C CT CAAGGACCCGTTTAGAGGCCCCAAGGGGTTAT 
20 GCTAGACTGCA-3 1 (SEQ ID NO: 78585) 

Random fragments of E. coli genomic DNA were generated by DNAsel digestion or 

sonication, filled in with T4 polymerase, and cloned into the Smal site of pLEX5BA or pLEX5BA- 

3*. Upon activation or induction, the promoter transcribed the random genomic fragments. 

A number of vectors which allow the production of transcripts which have an extended 

25 lifetime in E. coli as well as other Gram negative bacteria can also be utilized m conjunction with 
these antisense inhibition experiments. Such vectors are described in U.S. Provisional Patent 
Application Serial Number 60/343,512, filed December 21, 2001. Briefly, the stabilized antisense 
RNA may comprise an antisense RNA which was identified as inhibiting proliferation as described 
above which has been engineered to contain at least one stem loop flanking each end of the 

30 antisense nucleic acid. In some embodiments, the at least one stem-loop structure formed at the 5' 
end of the stabilized antisense nucleic acid comprises a flush, double stranded 5' end. In some 
embodiments, one or more of the stem loops comprises a rho independent terminator. In additional 
embodiments, the stabilized antisense RNA lacks a ribosome binding site. In further embodiments, 
the stabilized RNA lacks sites which are cleaved by one or more RNAses, such as RNAse E or 

35 RNAse HI. In some embodiments, the stabilized antisense RNA may be transcribed in a cell which 
the activity of at least one enzyme involved in RNA degradation has been reduced. For example, 
the activity of an enzyme such as RNase E, RNase II, RNase HI, polynucleotide phosphorylase, and 
poly(A) polymerase, RNA helicase, enolase or an enzyme having similar functions may be reduced 
in the cell. 



197 



WO 02/077183 PCT/US02/09107 

To study the effects of transcriptional induction in liquid medium, growth curves were carried 
out by back diluting cultures 1:200 into fresh media with or without 1 mM IPTG and measuring the 
OD 45 o every 30 minutes (min). To study the effects of transcriptional induction on solid medium, 10 2 , 
10 3 , 10 4 , 10 s , 10 6 , 10 7 and 10 8 fold dilutions of overnight cultures were prepared Aliquots of from 0.5 
5 to 3 \il of these dilutions were spotted on selective agar plates with or without 1 mM IPTG. After 
overnight incubation, the plates were compared to assess the sensitivity of the clones to IPTG. 

Of the numerous clones tested, some clones were identified as containing a sequence that 
inhibited E. coli growth after IPTG induction. Accordingly, the gene to which the inserted nucleic acid 
sequence corresponds, or a gene within the operon containing the inserted nucleic acid, is required for 

1 0 proliferation in E. coli. 

Nucleic acids involved in proliferation of Staphylococcus aureus, Enterococcus faecalis, 
Klebsiella pneumoniae, Pseudomonas aeruginosa and Salmonella typhimurium were identified as 
follows. Randomly generated fragments of Staphylococcus aureus, Enterococcus faecalis, 
Klebsiella pneumoniae, Pseudomonas aeruginosa or Salmonella typhimurium genomic DNA were 

1 5 transcribed from inducible promoters. 

In the case of Staphylococcus aureus, a novel inducible promoter system, XylT5, 
comprising a modified T5 promoter fused to the xylO operater from the xylA promoter of 
Staphylococcus aureus was used. The promoter is described in U.S. Patent Application Serial 
Number 10/032,393, filed December 21, 2001. Transcription from this hybrid promoter is 

20 inducible by xylose. 

Randomly generated fragments of Salmonella typhimurium genomic DNA were 
transcribed from an IPTG inducible promoter in pLEXSBA (Krause et ah, J. Mol. Biol. 274: 365 
(1997) or a derivative thereof. Randomly generated fragements of Klebsiella pneumoniae genomic 
DNA were expressed from an IPTG inducible promoter in pLEX5BA-Kan. To construct 

25 pLEX5BA-kan, pLEX5BA was digested to completion with Clal in order to remove the bla gene. 
Then the plasmid was treated with a partial Notl digestion and blunted with T4 DNA polymerase. A 
3.2 kbp fragment was then gel purified and ligated to a blunted 1.3 kbp kan gene from pKaror. Kan 
resistant transformants were selected on Kan plates. Orientation of the kan gene was checked by 
Smal digestion. A clone, which had the kan gene in the same orientation as the bla gene, was used 

30 to identify genes required for proliferation of Klebsiella pneumoniae. Randomly generated 
fragments of Pseudomonas aeruginosa genomic DNA were trancribed from a two-component 
inducible promoter system. Integrated on the chromosome was the T7 RNA polymerase gene 
regulated by lacUVS/ lacO (Brunschwig, E. and Darzins, A. 1992. Gene 111:35-41. On a separate 
plasmid, a T7 gene 10 promoter, which is transcribed by T7 RNA polymerase, was fused with a 

35 lacO operator followed by a multiple cloning site. 
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Should the genomic DNA downstream of the promoter contain, in an antisense orientation, 
at least a portion of an mRNA or a non-translated RNA encoding a gene product involved in 
proliferation, then induction of transcription from the promoter will result in detectable inhibition of 
proliferation. 

5 In the case of Staphylococcus aureus, a shotgun library of Staphylococcus aureus genomic 

fragments was cloned into the vector pXyIT5-P15a, which harbors the XylT5 inducible promoter. 
The vector was linearized at a unique BamHL site immediately downstream of the XyIT5 
promoter/operator. The linearized vector was treated with shrimp alkaline phosphatase to prevent 
reclosure of the linearized ends. Genomic DNA isolated from Staphylococcus aureus strain RN450 

10 was fully digested with the restriction enzyme Sau3A , or , alternatively, partially digested with 
DNase I and "blunt-ended" by incubating with T4 DNA polymerase. Random genomic fragments 
between 200 and 800 base pairs in length were selected by gel purification. The size-selected 
genomic fragments were added to the linearized and dephosphorylated vector at a molar ratio of 0.1 
to 1, and ligated to form a shotgun library. 

15 The ligated products were transformed into electrocompetent E. coli strain XL 1 -Blue MRF* 

(Stratagene) and plated on LB medium with supplemented with carbenicillin at 100 p,g/ml. 
Resulting colonies numbering 5 x I0 5 or greater were scraped and combined, and were then 
subjected to plasmid purification. 

The purified library was then transformed into electrocompetent Staphylococcus aureus 

20 RN4220. Resulting transformants were plated on agar containing LB + 0.2% glucose (LBG 
medium) + chloramphenicol at 15 p.g/ml (LBG+CM15 medium) in order to generate 100 to 150 
platings at 500 colonies per plating. The colonies were subjected to robotic picking and arrayed 
into wells of 384 well culture dishes. Each well contained lOOpi of LBG + CM15 liquid medium. 
Inoculated 384 well dishes were incubated 16 hours at 37°C, and each well was robotically gridded 

25 onto solid LBG + CM 15 medium with or without 2% xylose. Gridded plates were incubated 16 
hours at 37°C, and then manually scored for arrayed colonies that were growth-compromised in the 
presence of xylose. 

Arrayed colonies that were growth-sensitive on medium containing 2% xylose, yet were 
able to grow on similar medium lacking xylose, were subjected to further growth sensitivity 

30 analysis as follows: Colonies from the plate lacking xylose were manually picked and inoculated 
into individual wells of a 96 well culture dish containing LBG + CM15, and were incubated for 16 
hours at 37°C. These cultures were robotically diluted 1/100 into fresh medium and allowed to 
incubate for 4 hours at 37°C, after which they were subjected to serial dilutions in a 384 well array 
and then gridded onto media containing 2% xylose or media lacking xylose. After growth for 16 

35 hours at 37°C, the arrays that resulted on the two media were compared to each other. Clones that 
grew similarly at all dilutions on both media were scored as a negative and were no longer 
considered. Clones that grew on xylose medium but failed to grow at the same serial dilution on 
the non-xylose plate were given a score based on the differential, i.e. should the clone grow at a 
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serial dilution of 10 4 or less on the xylose plate and grow at a serial dilution of 10 or less on the 
non-xylose plate, then the corresponding clone received a score of "4" representing the log 
difference in growth observed. 

For Salmonella typhimurium and Klebsiella pneumoniae growth curves were carried out by 
5 back diluting cultures 1:200 into fresh media containing 1 mM IPTG or media lacking IPTG and 
measuring the OD 450 every 30 minutes (min). To study the effects of transcriptional induction on solid 
medium, 10 2 , 10 3 , 10 4 , 10 s , 10 6 , 10 7 and 10 8 fold dilutions of overnight cultures were prepared. 
Aliquots of from 0.5 to 3 \a\ of these dilutions were spotted on selective agar plates with or without 1 
mM IPTG. After overnight incubation, the plates were compared to assess the sensitivity of the clones 
10 to IPTG. 

Nucleic acids involved in proliferation of Pseudomonos aeruginosa were identified as 
follows. Randomly generated fragments of Pseudomonos aeruginosa genomic DNA were 
transcribed from a two-component inducible promoter system. Integrated on the chromosome was 
the T7 RNA polymerase gene regulated by lacUVS/ lacO (Brunschwig, E. and Darzins, A. 1992. 

1 5 Gene 1 1 1:35-41). On an expression plasmid there was a T7 gene 10 promoter, which is transcribed 
by 17 RNA polymerase, fused with a lacO operator followed by a multiple cloning site. 
Transcription from this hybrid promoter is inducible by IPTG. Should the genomic DNA 
downstream of the promoter contain, in an antisense orientation, at least a portion of an mRNA 
encoding a gene product involved in proliferation, then induction of expression from the promoter 

20 will result in detectable inhibition of proliferation. 

A shotgun library of Pseudomonos aeruginosa genomic fragments was cloned into the 
vectors pEP5, pEP5S, or other similarly constructed vectors which harbor the TllacO inducible 
promoter. The vector was linearized at a unique Smal site immediately downstream of the TllacO 
promoter/operator. The linearized vector was treated with shrimp alkaline phosphatase to prevent 

25 reclosure of the linearized ends. Genomic DNA isolated from Pseudomonos aeruginosa strain 
PAOl was partially digested with DNase I and "blunt-ended" by incubating with T4 DNA 
polymerase. Random genomic fragments between 200 and 800 base pairs in length were selected 
by gel purification. The size-selected genomic fragments were added to the linearized and 
dephosphorylated vector at a molar ratio of 2 to 1, and ligated to form a shotgun library. 

30 The ligated products were transformed into electrocompetent E. coli strain XLl-Blue MRF 

(Stratagene) and plated on LB medium with carbeniciilin at 100 fig/ml or Streptomycin 100 fig/ml. 
Resulting colonies numbering 5 x 10 5 or greater were scraped and combined, and were then 
subjected to plasmid purification. 

The purified library was then transformed into electrocompetent Pseudomonos aeruginosa 

35 strain PAOl. Resulting transformants were plated on LB agar with carbeniciilin at 100 |ig/ml or 
Streptomycin 40 Hg/ml in order to generate 100 to 150 platings at 500 colonies per plating. The 
colonies were subjected to robotic picking and arrayed into wells of 384 well culture dishes. Each 
well contained 100 \i\ of LB + CB 100 or Streptomycin 40 liquid medium. Inoculated 384 well 
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dishes were incubated 16 hours at room temperature, and each well was robotically gridded onto 
solid LB + CB100 or Streptomycin 40 medium with or without 1 mM IPTG. Gridded plates were 
incubated 16 hours at 37°C, and then manually scored for arrayed colonies that were growth- 
compromised in the presence of IPTG. 
5 Arrayed colonies that were growth-sensitive on medium containing 1 mM IPTG, yet were 

able to grow on similar medium lacking IPTG, were subjected to further growth sensitivity analysis 
as follows: Colonies from the plate lacking IPTG were manually picked and inoculated into 
individual wells of a 96 well culture dish containing LB + CB100 or Streptomycin 40, and were 
incubated for 16 hours at 30°C. These cultures were robotically diluted 1/100 into fresh medium 

10 and allowed to incubate for 4 hours at 37°C, after which they were subjected to serial dilutions in a 
384 well array and then gridded onto media with and without 1 mM IPTG. After growth for 16 
hours at 37°C, the arrays of serially diluted spots that resulted were compared between the two 
media. Clones that grew similarly at all dilutions on both media were scored as a negative and were 
no longer considered. Clones that grew on IPTG medium but failed to grow at the same serial 

15 dilution on the non-IPTG plate were given a score based on the differential, i.e. should the clone 
grow at a serial dilution of 10 4 or less on the IPTG plate and grow at a serial dilution of 10 8 or less 
on the IPTG plate, then the corresponding clone received a score of u 4" representing the log 
difference in growth observed. 

Following the identification of those vectors that, upon induction, negatively impacted 

20 Pseudomonas aeruginosa growth or proliferation, the inserts or nucleic acid fragments contained in 
those vectors were isolated for subsequent characterization. Vectors of interest were subjected to 
nucleic acid sequence determination. 

Nucleic acids involved in proliferation of E. faecalis were identified as follows. Randomly 
generated fragments of genomic DNA were expressed from the vectors pEPEF3 or pEPEF14, 

25 which contain the CP25 or P59 promoter, respectively, regulated by the xyl operator/repressor. 
These plasmids as well as other vectors useful for the expression of nucleic acids in Enterococcus 
faecalis and other Gram positive organisms are described in U.S. Patent Application Serial Number 
10/032,393, filed December 21, 2001, the disclosure or which is incorportated herein by reference 
in its entirety. Should the genomic DNA downstream of the promoter contain, in an antisense 

30 orientation, at least a portion of a mRNA encoding a gene product involved in proliferation, then 
induction of expression from the promoter will result in detectable inhibition of proliferation. 

A shotgun library of E. faecalis genomic fragments was cloned into the vector pEPEF3 or 
pEPEF14, which harbor xylose inducible promoters. The vector was linearized at a unique Smal 
site immediately downstream of the promoter/operator. The linearized vector was treated with 

35 alkaline phosphatase to prevent reclosure of the linearized ends. Genomic DNA isolated from E. 
faecalis strain OG1RF was partially digested with DNase I and "blunt-ended" by incubating with 
T4 DNA polymerase. Random genomic fragments between 200 and 800 base pairs in length were 
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selected by gel purification. The size-selected genomic fragments were added to the linearized and 
dephosphorylated vector at a molar ratio of 2 to 1, and ligated to form a shotgun library. 

The ligated products were transformed into electrocompetent E. coli strain TOP10 cells 
(Invitrogen) and plated on LB medium with erythromycin (Erm)- at 150 p.g/ml. Resulting colonies 
5 numbering 5 x 10 s or greater were scraped and combined, and were then subjected to plasmid 
purification. 

The purified library was then transformed into electrocompetent E. faecalis strain OG1RF. 
Resulting transformants were plated on Todd-Hewitt (TH) agar with erythromycin at 10 ng/ml in 
order to generate 100 to 150 platings at 500 colonies per plating. The colonies were subjected to 
10 robotic picking and arrayed into wells of 384 well culture dishes. Each well contained 100 nl of 
THB + Erm 10 ^g/ml. Inoculated 384 well dishes were incubated 16 hours at room temperature, 
and each well was robotically gridded onto solid TH agar + Erm with or without 5% xylose. 
Gridded plates were incubated 16 hours at 37°C, and then manually scored for arrayed colonies that 
were growth-compromised in the presence of xylose. 
15 Arrayed colonies that were growth-sensitive on medium containing 5% xylose, yet were 

able to grow on similar medium lacking xylose, were subjected to further growth sensitivity 
analysis. Colonies from the plate lacking xylose were manually picked and inoculated into 
individual wells of a 96 well culture dish containing THB + Erm 10, and were incubated for 16 
hours at 30°C. These cultures were robotically diluted 1/100 into fresh medium and allowed to 
20 incubate for 4 hours at 37°C, after which they were subjected to serial dilution on plates containing 
5% xylose or plates lacking xylose. After growth for 16 hours at 37°C, the arrays of serially diluted 
spots that resulted were compared between the two media. Colonies that grew similarly on both 
media were scored as a negative and corresponding colonies were no longer considered. Colonies 
on xylose medium that failed to grow to the same serial dilution compared to those on the non- 
25 xylose plate were given a score based on the differential. For example, colonies on xylose medium 
that only grow to a serial dilution of -4 while they were able to grow to -8 on the non-xylose plate, 
then the corresponding transformant colony received a score of "4" representing the log difference 
in growth observed. * 

Following the identification of those vectors that, upon induction, negatively impacted E. 
30 faecalis growth or proliferation, the inserts or nucleic acid fragments contained in those expression 
vectors were isolated for subsequent characterization. The inserts in the vectors of interest were 
subjected to nucleotide sequence determination. 

It will be appreciated that other restriction enzymes and other endonucleases or 
methodologies may be used to generate random genomic fragments. In addition, random genomic 
35 fragments may be generated by mechanical shearing. Sonication and nebulization are two such 
techniques commonly used for mechanical shearing of DNA. 
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EXAMPLE 2 

Nnrfftotide Sequence Determination of Identif y Hones Transrihing Nucleic Acid Fragment? with 
Det rimental Effects on Escherichia coll S tap hylococcus aureus. E nterococcus faecalis ^Klebsiella 
pneumoniae Pseudomonas aeruginosa or Salmonella typ himurium Proliferation 
5 Plasmids from clones that received a dilution plating score of "2" or greater were isolated to 

obtain the genomic DNA insert responsible for growth inhibition as follows. 

The nucleotide sequences of the nucleic acid sequences which inhibited the growth of 
Escherichia coli were determined using plasmid DNA isolated using QIAPREP (Qiagen, Valencia, 
CA) and methods supplied by the manufacturer. The primers used for sequencing the inserts were 5' - 
10 TGTTTATCAGACCGCTT - 3' (SEQ ID NO: 78586) and 5' - ACAATTTCACACAGCCTC - 3' 
(SEQ ID NO: 78587). These sequences flank the polylinker in pLEX5BA. 

The nucleotide sequences of the nucleic acid sequences which inhibited the growth of 
Staphylococcus aureus were determined as follows. Staphylococcus aureus were grown in standard 
laboratory media (LB or TB with 15 ug/ml Chloramphenicol to select for the plasmid). Growth 
15 was carried out at 37°C overnight in culture tubes or 2 ml deep well microliter plates. 

Lysis of Staphylococcus aureus was performed as follows. Cultures (2-5 ml) were 
centrifuged and the cell pellets resuspended in 1.5 mg/ml solution of lysostaphin (20 uJAnl of 
original culture) followed by addition of 250 \i\ of resuspension buffer (Qiagen). Alternatively, cell 
pellets were resuspended directly in 250 jil of resuspension buffer (Qiagen) to which 5-20 jal of a 1 
20 mg/ml lysostaphin solution were added. 

DNA was isolated using Qiagen miniprep kits or Wizard (Qiagen) miniprep kits according 
to the instructions provided by the manufacturer. - 

The genomic DNA inserts were amplified from the purified plasmids by PCR as follows. 
1 jil of Qiagen purified plasmid was put into a total reaction volume of 25 p.1 Qiagen Hot 
25 Start PCR mix. For Staphylococcus aureus, the following primers were used in the PCR reaction: 
P XylT5F: CAGCAGTCTGAGTTATAAAATAG (SEQ ID NO: 78588) 
LexL TGTTTTATCAGACCGCTT (SEQ ID NO: 78589) 

Similar methods were conducted for Salmonella typhimurium and Klebsiella pneumoniae. 
For Salmonella typhimurium and Klebsiella pneumoniae the following primers were used: 
30 5' - TGTTTTATCAGACCGCTT - 3' (SEQ ID NO: 78589) and 
5'-ACAATTTCACACAGCCTC-3' (SEQ ID NO: 78587) 
PCR was carried out in a PE GenAmp with the following cycle times: 
Step 1. 95° C 15 min . r * 
Step 2. 94° C 45 j>ec ., 
35 Step 3. 54° C 45 sec 
Step 4. 72° C 1 minute 
Step 5. Return to step 2, 29 times 
Step 6. 72° C 10 minutes 

: 1^203 
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Step 7. 4° C hold 

The PCR products were cleaned using Qiagen Qiaquick PCR plates according to the manufacturer's 
instructions. 

For Pseudomonas aeruginosa, plasmids from transformant colonies that received a dilution 
5 plating score of "2" or greater were isolated to obtain the genomic DNA insert responsible for 

growth inhibition as follows. Pseudomonas aeruginosa were grown in standard laboratory media 

(LB with carbenicillin at 100 }ig/ml or Streptomycin 40 ng/ml to select for die plasmid). Growth 

was carried out at 30°C overnight in 100 ul culture wells in micro titer plates. To amplify insert 

DNA 2 ul of culture were placed into 25 ul Qiagen Hot Start PCR mix. PCR reactions were in 96 
10 well microliter plates. For plasmid pEP5S the following primers were used in the PCR reaction: 

T7L1+: GTCGGCGATATAGGCGCCAGCAACCG (SEQ ID NO: 78590) 

P StrA3: ATAATCGAGCATGAGTATCATACG (SEQ ID NO: 78591) 

PCR was carried out in a PE GenAmp with the following cycle times: 

Stepl. 95° C 15min 
15 Step 2. 94° C 45 sec 

Step 3. 54° C 45 sec 

Step 4. 72° C 1 minute 

Step 5. Return to step 2, 29 times 

Step 6. 72° C 10 minutes 
20 Step 7. 4° C hold 

The PCR products were cleaned using Qiagen Qiaquick PCR plates according to the manufacturer's 
instructions. 

The purified PCR products were then directly cycle sequenced with Qiagen Hot Start PCR 

mix. The following primers were used in the sequencing reaction: 
25 T7/L2: ATGCGTCCGGCGTAGAGGAT (SEQ ID NO: 78592) 

PCR was carried out in a PE GenAmp with the following cycle times: 

Step 1. 94° C 15min 

Step 2. 96° C 10 sec 

Step 3. 50° C 5 sec 
30 Step 4. 60C4min 

Step 5. Return to step 2, 24 times 

Step 6. 4° C hold 

The PCR products were cleaned using Qiagen Qiaquick PCR plates according to the manufacturer's 
instructions. 

35 For E. faecalis, plasmids from transformant colonies that received a dilution plating score 

of "2" or greater were isolated to obtain the genomic DNA insert responsible for growth inhibition 
as follows. E. faecalis were grown in THB 10 fig/ml Erm at 30°C overnight in 100 ul culture wells 
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in microliter plates. To amplify insert DNA 2 ul of culture were placed into 25 \il Qiagen Hot Start 
PCR mix. PCR reactions were in 96 well microtiter plates. The following primers were used in the 
PCR reaction: 

pXylT5: CAGCAGTCTGAGTTATAAAATAG (SEQ ID NO: 78588) and the 
5 pEP/pAKl primer. 

PCR was carried out in a PE GenAmp with the following cycle times: 
Step 1. 95° C 15 min 
Step 2. 94° C 45 sec 
Step 3. 54° C 45 sec 
10 Step 4. 72° C 1 minute 

Step 5. Return to step 2, 29 times 
Step 6. 72° C 10 minutes 
Step 7. 4° C hold 

The PCR products were cleaned using Qiagen Qiaquiclc PCR plates according to the manufacturer's 
15 instructions. 

The purified PCR products were then directly cycle sequenced with Qiagen Hot Start PCR 

mix. The following primers were used in the PCR reaction: 

pXylT5: CAGCAGTCTGAGTTATAAAATAG (SEQ ID NO: 78588) 

PCR was carried out in a PE GenAmp with the following cycle times: 
20 Step 1. 94° C 15 min 

Step 2. 96° C 10 sec 

Step 3. 50° C 5 sec 

Step 4. 60° C 4 min 

Step 5. Return to step 2, 24 times 
25 Step 6. 4° C hold 

The PCR products were cleaned using Qiagen Qiaquiclc PCR plates according to the 

manufacturer's instructions. 

The amplified genomic DNA inserts from each of the above procedures were subjected to 

automated sequencing. Sequence identification numbers (SEQ ID NOs) and clone names for the 
30 identified inserts are listed in Table LA and discussed below. 
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Clone Name 


S1M10000025G06 

S1M10000025H06 

S1M10000025H07 

S1M10000025A08 

S1M10000025D08 

S1M10000025F08 

S1M10000025H08 

S1M10000025A09 

S1M10000025B09 

S1M10000025C09 

S1M10000025D09 

S1M10000025E09 

S1M10000025F09 

S1M10000025A10 

S1M10000025C10 

S1M10000025D10 

S1M10000025F10 

S1M10000025G10 

S1M10000025H10 

S1M10000025CU 

S1M10000025E11 

S1M10000025B12 

S1M10000025F12 

S1M10000026C01 

S1M10000026E01 

S1M10000026F01 

S1M10000026G01 

S1M10000026H01 

S1M10000026A02 

S1M10000026B02 

S1M10000026H02 

S1M10000026B03 

S1M10000026F03 


SeqID) 




Clone Name ( 


P1M10000105C04 

P1M10000105D04 

P1M10000105C05 

P1M10000105B06 

P1M10000105C08 

P1M10000105H08 

P1M10000105D09 

PlMlOOOOllOEOl 

PlMlOOOOllOFOl 

PlMlOOOOllOGOl 

P1M10000110B02 

P1M10000110B03 

PlMl 00001 10F03 

P1M10000110G03 

P1M10000110D04 

P1M10000110F04 

P1M10000110B05 

P1M10000110E05 

P1M10000I10B07 

P1M10000110B08 

P1M10000110F08 

P1M10000110A09 

P1M10000110E09 

P1M10000110F09 

PIMIOOOOIOOFOI 

P1M10000098A02 

P1M10000098B02 

P1M10000098A03 

P1M10000098D03 

P1M10000098E04 

P1M10000098G04 

P1M10000098A05 

P1M10000098C05 


SeqID I 




Clone Namel 


E1M10000260G02 

E1M10000260F04 

E1M10000260A05 

E1M10000260C05 

E1M10000260E05 

E1M10000260C07 

E1M10000260G07 

E1M10000260B08 

E1M10000260D08 

E1M10000260E08 

EIM10000260E09 

E1M10000260C10 

E1M10000260D10 

E1M10000260E10 

E1M10000260G10 

E1M10000260H10 

E1M10000260H11 

E1M10000260B12 

E1M10000260D12 

E1M10000260G12 

E1M10000261F01 

E1M10000261B02 

E1M10000261H02 

E1M10000261G04 

E1M10000261H05 

E1M10000261G06 

E1M10000261H06 

E1M10000261D08 

E1M10000261F08 

E1M10000261C09 

E1M10000261H09 

E1M10000261E10 

E1M10000262E01 


SeqID 1 




Clone Namel 


P33-LC22 
X3S107-17 
P35-7 
X3SH8-9 
X3S163-1 
X3S204-7 
X3S177-4 
P342-3 
SC2L1 
SC17.1 
SC13.1 
MC9.6 
Z60-P16 
Z86-I21 
E1M10000109A02 
E1M10000109AH 
E1M10000101F05 
E1M10000101D06 
E1M10000101A07 
E1M10000101H07 
E1M10000101H09 
E1M10000101C12 
E1M10000103B04 
E1MI0000103DH 
ElMlOOOOUOGOl 
ElMlOOOOllOHOl 
E1M10000110E09 
E1M10000U0A12 1 
E1M10000112F05 
EIM10000113F02 
E1M10000113A11 
E1M10000111C03 
E1M10000111E04 


1 SeqID 1 




1 Clone Namel 


OOOOOOOOOOOOOOOOOOOOOOO.OOOO— 'OOOOO 

ooooooo 0 oo 0 oooo 00 oooogooooOoooooo 
ooooooo 00 oooooo 00 oooo^ooooOoooooo 
oooooOOoo$oooOOooooooS 0 2°os°oooOo 
oooooo2ooOooooo 00 ooooS°2°o2°ooooo 
oooooooooOooooOooooooS<^2ooSoooooo 

ooooo^Ooo^ooooOooooooS 000000000 

aaaaaaaaaaaaaaaaaaaaasaaaaaaaaaaa 


SeqID 1 


^HMnn•^^^ooo^o^fSn^^n^o^ooo^0^oJnTr»n^^ooo\0^(Nro 
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Clone Name 


S1M10000026G03 
S1M10000026H03 
S1M10000026A04 
S1M10000026D04 
S1M10000026F04 
S1M10000026G04 
S1M10000026H04 
S1M10000026A05 

S1M10000026D05 

S1M10000026F05 

S1M10000026G05 

S1M10000026H05 

S1M10000026A06 

S1M10000026B06 

S1M10000026C06 

S1M10000026D06 

S1M10000026F06 

S1M10000026G06 

S1M10000026A07 

S1M10000026B07 

S1M10000026C07 

S1M10000026D07 

S1M10000026F07 

S1M10000026G07 

S1M10000026H07 

S1M10000026A08 

S1M10000026C08 

S1M10000026D08 

S1M10000026F08 

S1M10000026G08 

S1M10000026A09 

S1M10000026E09 

S1M10000026G09 

S1M10000026H09 

S1M10000026A10 


SeqID 


ill 11 llSSgooSSSoooggSooo 


Clone Name j 


P1M10000098G06 

P1M10000098H06 

P1M10000098C07 

P1M10000098F07 

P1M10000098A08 

P1M10000098G08 

P1M10000098H09 

P1M10000098B11 

P1M10000098C12 

P1M10000099D01 

P1M10000099G03 

P1M10000099A09 

P1M10000099A10 

P1M10000099E10 

P1M10000099F10 

P1M10000099D11 

P1M10000106D02 

P1M10000106F03 

P1M10000106H03 

P1M10000106F04 

P1M10000106D05 

P1M10000106E07 

P1M10000107E02 

P1M10000107H02 

P1M10000107C03 

P1M10000107A04 

P1M10000107C04 

P1M10000107C09 

P1M10000107C10 

PIM10000107D10 

P1M10000107H10 

P1M10000108C01 

P1M10000108A02 

P1M10000108B02 

P1M10000108A03 

P1M10000108D04 


SeqID 1 




Clone Namel 


ooooooooOOoooooSoooQOooooooogogoooj^ 0 

ooo§§oooooooooo2 ( = >oo0 2 000 2222222222S^ 


SeqID | 




Clone Namel 


ooooooooooooooo^^oooo^^ooooooopo^ooo 

Sl|8i§8il§SSSSS88S§SS§S§8§88§8S§8§8§ 


SeqID 1 


\o^ooo^01-r^ln^>ovo^ooo\OH^^^^^nvo^ooo^OH(sn^^A^^ooo\OH 
r^r^t^r^ooooooooooooooooooooo\o\o\a\c*ONO\ONONa>oooooooooo«^^ 


Clone Name 1 


0»— 'i— '^—ir-iOOOOOOOOOOOOOOO'— i i— »O0OOOO0*-««— ' "OOOO 

SoS^oooooooOoooooSoooSooooSoooooSooo 
^ogooQOOOooOooooo^oooOoooogooOoogooo 
£o22ooooooooooooog°2 < ^£°ooo£ooooogooo 

§88888 88 888§88§§8§88888§888§8§88§88S 

aawaaaaaaaaaaaaaasaaasaaaasaaaaawaaa 


SeoIDl 


^^n^o^ooa^o^(Nn^w^^o^ooc^OrM^^f0^^n^^ooc^o^(NrO'tlnvo^ooo^ 



207 



WO 02/077183 



PCTAJS02/09107 



Clone Name 


S1M10000026B10 
S1M10000026D10 
S1M10000026E10 
S1M10000026F10 
S1M10000026G10 
SIM10000026H10 
S1M10000026A11 
S1M10000026B11 
S1M10000026C11 
S1M10000026E11 
S1M10000026B12 
S1M10000026C12 
S1M10000026D12 
S1M10000026E12 
S1M10000026F12 
i S1M10000026G12 
S1M10000027G01 
S1M10000027C02 
S1M10000027D02 
S1M10000027E02 
S1M10000027F02 
S1M10000027H02 
S1M10000027D03 
S1M10000027E03 
S1M10000027G03 
S1M10000027A04 
S1M10000027C04 
S1M10000027G04 
S1MI0000027H04 
S1M10000027A05 
S1M10000027C05 
S1M10000027D05 
S1M10000027E05 
S1M10000027F05 
S1M10000027G05 
S1MI0000027H05 


SeqID 


sssississssssssssssss 


Clone Name J 


P1M10000108G04 

P1M10000108E05 

P1M10000108F05 

P1M10000108F06 

P1M10000108G06 

P1M10000109A02 

P1M1000O1O9CO3 

P1M10000109E03 

P1M10000109D04 

P1M10000109A05 

P1M10000109B08 

P1M10000109H09 

P1M10000109E10 

P1M10000109F10 

P1M10000109E11 

P1M10000109B12 

S4M10000001C01 

S4M10000002G04 

S4M10000002B06 

S4M10000002G08 

S4M10000002B09 

S4M10000019H06 

S4M10000008H10 

S4M10000009E03 

S4M10000009C06 

S4M10000009E07 

S4M10000009G08 

S4M10000009BU 

S4M10000009F11 

S4M10000009G11 

S4M10000010F04 

S4M10000010H04 

S4M10000010B05 

S4M10000010D07 

S4M10000010D08 

S4M10000010B09 


SeqID 1 




Clone Namel 


E1M10000264B05 
E1M10000264B06 
E1M10000264G09 
E1M10000264D11 
E1M10000264F11 
E1M10000264HH 
E1M10000264B12 

Jb 1 M 1 UUUUzO^K^ 1 £. 

E1M10000265A02 
E1M10000265E02 
E1M10000265G02 
E1M10000265D04 
E1M10000265F04 
EIM10000265E05 
j E1M10000265H05 
E1M10000265C09 
E1M10000265E09 
E1M10000265F09 
E1M10000265H10 
E1M10000265A11 
E1M10000265B11 
E1M10000265C11 
E1M10000266D02 
E1M10000266H02 
E1M10000266F04 
E1M10000266H04 
E1M10000266H05 
E1M10000266B06 
E1M10000266F11 
E1M10000267F01 
E1M10000267E04 
E1MI0000267A05 
E1M10000267B05 
E1M10000267A07 
E1M10000267H07 
E1M10000267E09 


SeqID 1 




Clone Namel 


SSSoSooooooooSoooooooooooooooooooooo 
ooooooooooooogoooooooooooooooooooooo 

Ill§|8g88§§888g88888888888 8 8 88888888 


SeqID [ 




Clone Namel 


ooooooooooooooooooooooooooooo— «oooooo 

OOOOOOQOoOoOoOOOOoOOoOO^t-^r-i^-i^^f-Ht-HT^^-i^i-i^ 

ooooooooooooooooOoOOoooSoooooooooooo 
oooooooooooooooogooooooOoooooooooooo 

oooooooooOooooooOooooooo 0 ooooooooooo 
ooooooooooooooooooooooogoooooooooooo 


SeoIDl 


^^^^^^^^^^ooooooooooMoooooo.ooo\c^(^c^(^c^o\o^o^c^oooooo 



208 



WO 02/077183 



PCTAJS02/09107 



Clone Name 


S1M10000027BUO 
S1M10000027C06 
S1M10000027D06 
S1M10000027E06 
S1M10000027F06 
S1M10000027G06 
S1M10000027H06 
S1M10000027B07 

S1M10000027E07 
S1M10000027G07 
S1M10000027H07 
S1M10000027A08 
S1M10000027B08 
S1M10000027C08 
S1M10000027D08 
S1M10000027E08 
S1M10000027F08 
S1M10000027G08 
S1M10000027H08 
S1M10000027B09 
S1M10000027C09 
S1M10000027D09 
S1M10000027E09 
S1M10000027F09 
S1M10000027G09 
S1M10000027H09 
S1M10000027D10 
S1M10000027H10 
S1M10000027A11 
S1M10000027B11 
S1M10000027D11 
S1M10000027E11 
S1M10000027G11 
S1M10000027H11 
» S1M10000028B01 


SeqID 




Clone Namej 


S4M10000010C09 
S4M10000010D09 
S4M10000010D10 
S4M10000011F05 
S4M10000011D08 
S4M10000011A09 
S4M10000011F09 
S4M10000011E10 
S4M10000011F10 
S4M10000011D11 
S4M10000012H03 
S4M10000012B06 
S4M10000012B12 
S4M10000013D02 
S4M10000013H02 
S4M10000014H02 
S4M10000014B05 
S4M10000014D07 
S4M10000015E09 
S4M10000015B11 
S4M10000016A02 
S4M10000020F08 
S4M10000021E07 
S4M10000022B02 
S4M10000022D04 
S4M10000022B05 
S4M10000022G07 
S4M10000022D12 
S4M10000022E12 
S4M10000024G01 
S4M10000024G04 
S4M10000024C06 
S4M10000024F08 
S4M10000024G09 
S4M10000024C11 
' S4M10000025E02 


SeqID 1 




Clone Name I 


E1M10000267G09 
E1M10000267H09 
E1M10000267A10 
E1M10000267E10 
E1M10000267C11 
E1M10000267E11 
E1M10000267B12 

E1M10000268F03 

E1M10000268D04 

E1M10000268E04 

E1M10000268F06 

E1M10000268E07 

E1M10000268A08 

E1M10000268B08 

E1M10000268D08 

E1M10000268G08 

E1M10000268B09 

E1M10000268E09 

E1M10000268F09 

E1M10000268G09 

E1M10000268E10 

E1M10000268A11 

E1M10000268G11 

E1M10000268G12 

E1M10000269D01 

E1M10000269B02 

E1M10000269D03 

E1M10000269D04 

E1M10000269H04 

E1M10000269B05 

E1M10000269D05 

E1M10000269H05 

E1M10000269A06 

E1M10000269E07 

E1M10000269F07 


SeqID I 




Clone Name 1 


oSSoooooSSooOoSooooSsSoggooSoooooooo 

8||§S8 88|§8S88§88S8§§§§222 2§§§ § 222 § 2 


SeqID 1 




Clone Name 1 


MH(N(N^^^^^oooor^f^ln^ln^n^o^ooooooo\fNr^^(Nfsnn5^^^ 

888§8 88§l88§88 88888888888888888 888 8 8 


SealD 1 





209 



WO 02/077183 PCT/US02/09107 



Clone Name 
— — ^ i 


SIM10000028F01 
S1M10000028G01 
S1M10000028A02 
S1M10000028B02 
S1M10000028C02 
S1M10000028G02 
S1M10000028B03 

o 1 IVl 1 UUU UuZol/v J 

S1M10000028E03 
S1M10000028F03 
S1M10000028G03 
S1M10000028H03 
S1M10000028A04 
S1M10000028B04 
S1M10000028C04 
S1M10000028D04 
S1M10000028E04 
S1M10000028F04 
S1M10000028G04 
S1M10000028B05 
S1M10000028C05 
S1M10000028D05 
S1M10000028F05 
S1M10000028G05 
S1M10000028H05 
S1M10000028A06 
S1M10000028B06 
S1M10000028C06 
S1M10000028D06 
S1M10000028F06 
S1M10000028G06 
S1M10000028D07 
S1M10000028F07 
S1M10000028H07 
; S1M10000028A08 


SeqID 




Clone Name! 


S4M10000025E05 

S4M10000025H07 

S4M10000025A11 

S4M10000025F12 

S4M10000026C01 

S4M10000026E03 

S4M10000026D04 

S4M10000026B10 

S4M10000026E12 

S4M10000027E02 

S4M10000027C10 

S4M10000029B12 

S4M10000029D12 

S4M10000030F06 

S4M10000030F07 

S4M10000032F01 

S4M10000032G01 

S4M10000032F03 

S4M10000034A02 

S4M10000034C05 

S4M10000034H05 1 

S4M10000034A06 

S4M10000034A09 

S4M10000034H09 

S4M10000035B01 

S4M10000035D01 

S4M10000035F02 

S4M10000035E03 

S4M10000035B06 

S4M10000035A09 

S4M10000036F06 

S4M10000036B09 

S4M10000036H11 

S4MI0000037A03 

S4M10000037A08 

S4M10000037H0S 


SeqID 1 




Clone Namel 


P!!g!§!i§§§§§§§§§§§§!§§§§§§§§!!!!!! 

gooogogoooggggg 0 g 00 gg 0 oooooooooooogo 

%%%%%%%% §§1111 |§§88|8888888 88888 8 8|8 


SeqID I 




Clone Name 1 


ooSoSoSooooSSoSoSooSSoSooSogoooooooo 

isillislll§8SSgS8§o8§8SSSgg§S§SS8§§S 


SeqID 1 


oooooocx)ooooo\ON<*ONC7\a\o\o\o\o\oooooo 


Clone Name 1 


^^OOOOOOOOOOOOOOOOOOO^^^^^^OOOOOOOOp 

ooooooooooooooooooooooooooooooooogog 
ooooooooooooooooooooooooooooooooogog 

aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaasas 


Q 
»— i 

s 


^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 

210 



WO 02/077183 



PCT/US02/09107 



Clone Name 


S1M10000028B08 
S1M10000028C08 
S1M10000028D08 
S1M10000028E08 
S1M10000028G08 
S1M10000028H08 
S1M10000028B09 
S1M10000028D09 
S1M10000029F01 
S1M10000029G01 
S1M10000029H01 
S1M10000029A02 
S1M10000029B02 
S1M10000029C02 
1 S1M10000029D02 
S1M10000029E02 
i S1M10000029F02 
S1M10000029G02 
S1M10000029A03 
S1M10000029B03 
S1M10000029C03 
S1M10000029G03 
S1M10000029A04 
S1M10000029B04 
S1M10000029F04 
S1M10000029G04 
S1M10000029B05 
S1M10000029C05 
S1M10000029D05 
S1M10000029E05 
S1M10000029G05 
S1M10000029H05 
S1M10000029B06 
S1M10000029H06 
i S1M10000029C07 
S1M10000029G07 


SeqID 


^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 


Clone Name | 


S4M10000037E10 
S4M10000037F10 
S4M10000033G05 
S4M10000033F08 
S4M10000033G09 
S4M10000033B10 
SIMIOOOOOOIDOI 
SIMIOOOOOOIFOI 
S1M10000001D02 
S1M10000001E02 
SIM10000001F02 
S1M10000001A04 
S1M10000001E04 
S1M10000001F04 
S1M10000001A05 
I S1M10000001D06 
S1M10000001D07 
S1M10000001A08 
S1M10000001F08 
S1M10000001A09 
S1M10000001E09 
S1M10000001F09 
SIMIOOOOOOIAIO 
SIMIOOOOOOIFIO 
SIMIOOOOOOIGIO 
SlMlOOOOOOlEll 
S1M10000002B01 
S1M10000002D01 
SIM10000002E01 
S1M10000002F01 
S1M10000002A02 
S1M10000002C02 
S1M10000002D02 
S1M10000002E02 
S1M10000002F02 
S1M10000002B03 


SeqID 1 




Clone Namel 


llilillillililliillliillillllililiii 

8§88§§88§8oo§§§o 2 oo2 2oooooooooooooOo 


SeqID I 




Clone Namel 


l§l§l§§ii§88§§§li§§l§§§o§|g||§§§g§8§ 

oo888o8eooooOoooooooooooogoo§oooogo§ 
oOooooo°ooooc>ooooooooooooOoo2 OC) 2 0 2 0 S 

SwwwSSSwwwww 


SeqID 1 




Clone Namel 


^Moo(^rH^^^^^NM^lf^n^^^t^vo\o^^ooooc^a\0^oOn5^Moo^H 
oooo^^^oooooooooooooooooooo^^qooooop 

ooOoooooogoooooooooooooooooOgooooooo 
ooOooooooSoooooOoooooooooooo 000 ooooo 

888888888§88888888888888888888888888 


SealDl 


(X)«T>OHNcoTt^vo^w^o^(sn^wi\OhooONOHcS(n^>rivor^oo(J\O^Wf<i 

hhOOOOMOOOOOOMMOOOO^O\^^^0\0\^^^OOOOOOOOOOH^^H 



211 



WO 02/077183 



PCT/US02/09107 



Clone Name 


S1M10000029B08 

S1M10000029G08 

S1M10000029H08 

S1M10000029A09 

S1M10000029C09 

S1M10000029D09 

S1M10000029F09 

S1M10000029H09 

S1M10000029A10 

S1M10000029B10 

S1M10000029C10 

S1M10000029D10 

S1M10000029E10 

S1M10000029F10 

S1M10000029H10 

S1M10000029A11 

S1M10000029E11 

S1M10000029F11 

S1M10000029A12 

S1M10000029C12 

S1M10000029D12 

S1M10000029F12 

S1M10000029G12 

S1M10000030B01 

S1M10000030D01 

S1M10000030F01 

S1M10000030H01 

S1M10000030A02 

S1M10000030B02 

S1M10000030C02 

S1M10000030D02 

S1M10000030E02 

S1M10000030H02 

S1M10000030B03 

S1M10000030C03 

SIM10000030D03 


SeqID | 




Clone Name | 


^on^^^n^vovo^^^^ooooo^c^o^(^(^ooo^.H : -(S22222ooooo 
ooooooooooppopgoppo^^^ 

1111111111 I I IISIIIII 

SSSpSlSiSS|SSSS|S33l|SsSS S s|ps|S|| 


SeqID 1 




Clone Namel 


1111 Hill lliliii 1 Hill in iiiiili 


SeqID I 




Clone Namel 


oo^ooooooooo^ooo^^ooooooooooooopooop 

SoooSSoooooooSSooooosSoooooooooOoooo 
ooooSoS§oooooSgooooooooooooooooOooog 

7^1 it jsl _ T . n O *—\ £^ O /■■■» o f ' /** "\ lj 
^5 ' *— ' o ^5 ^3 C-> f7> ' > ' o f c_J ' v — ' ' v»j vj *— ' o %^ * — ' *— < ^ 


SeqID 1 


\O^OOONO^(S(n^^VOt s OOO\OH^ffi^lT|VOr s '000\OH{Sn^' l A^Ot >s OOONOH 


Clone Name 1 


rHn^^to^^o^^^^ooooooo\o\c^oooHHH(^|(^l^H(s^^mmfo^lA^^t s ■ 

OOOO0OOOOOOOOO000»~**-'^-''-'»-«'-«*-«'-<00O00O0OpOO 

oooOoooRoooooooOooogooooo 00 ooooooooo 
ooooooo°ooooooo2ooo2 00 2 2 0000oooooC>0 

aaaaaaaaaaaaaaaaaaaaaa-aaaaaaaaaaaaaa 


SeqID 1 


^^^o^ooo^OHfs^n , t^nvo^ooo\OH^|^^Ttw^^o^ooo^O'Hr^nt^n v o^ooo^ 



212 



WO 02/077183 



PCT/US02/09107 



Clone Name 


S1M10000030G03 

S1M1000003GH03 

S1M10000030C04 

S1M10000O30AO5 

S1M10000030B05 

S1M10000030C05 

S1M10000030D05 

S1M10000030G05 

S1M10000030H05 

S1M10000030D06 

S1M10000030E06 

S1M10000030B07 

S1M10000030D07 

S1M10000030G07 

S1M10000030H07 

S1M10000030C08 

S1M10000030F08 

S1M10000030G08 

S1M10000030A09 

S1M10000030B09 

S1M10000030C09 

S1M10000030D09 

S1M10000030F09 

S1M10000030G09 

S1M10000030H09 

S1M10000030A10 

S1M10000030C10 

S1M10000030D10 

S1M10000030F10 

S1M10000030G10 

S1M10000030H10 

S1M10000030A11 

S1M10000030D11 

S1M10000030E11 

S1M10000030G11 

S1M10000030C12 


SeqID 1 




Clone Name I 


S1M10000003G03 

S1M10000003A04 

S1M10000003E04 

S1M10000003G04 

S1M10000003A05 

S1M10000003F05 

S1M10000003A06 

S1M10000003B06 

S1M10000003C06 

S1M10000003D06 

S1M100000O3FO6 

S1M10000003A07 

S1M10000003C07 

S1M10000003E07 

S1M10000003F07 

S1M10000003B08 

S1M10000003D08 

S1M10000003A09 

S1M10000003B09 

S1M10000003E09 

S1M10000003A10 

S1M10000003C10 

S1M10000003D10 

S1M10000003E10 

S1M10000003A11 

S1M10000003E11 

S1M10000003B12 

S1M10000003C12 

S1M10000003F12 

S1M10000004C01 

S1M10000004D01 

S1M10000004F01 

S1M10000004G01 

S1M10000004C02 

S1M10000004F02 

S1M10000004B03 


SeqID 1 


vo^ooa^OHrsrn^ln^^ooo^o^^(Nfn^^n^^oo(^OHNn^^^^ooc^o^ 
^^^^coMoooooooooooooo(»c^(^^c^o^o\o\^(^o\ooooooooooHH 


i Clone Name 1 


^-•oooooooooooooooooooo^ooooooooopo^^rt 

ooooooooooooogooo 00 oo 0 goooooooogoooo 
ooOoooooooooo2ooo 00 ooo2 0oooo000 2 000 2 


I SeqID 1 


t■«nvo^^oo{^o^fSfn^^vD^oo(^0'-•^^m^■^n\o^*ooo^O'-<^lfO^■v^vo^K)c^ 


I Clone Name 1 


OOOOOOOOOOQOO^'-hOOOOOOOO^'-H'-'OOOOOi— ir-Hr-rr-Hi— ( 
yOVO^^h^OOOOOOW^OOWMoooQooM^MooooOOCOWoOOOM 

oooooOooooSogOooooogooooQooooooooogo 
oooooOoooo 0 oOo 000 oogooooOooooooooogo 

ooooo2° oo 2 00 22 00000 2 0000000000000 2 0 
SSSSSwSSwwSSww 


1 SeqID 1 




| Clone Name 1 


oooooo^ooooooooooooooooooooooooooooo 
^^^^^^^oooooooOooooooooScowooooooooooooclooooooooo 0 9oooocooooooo 

000000000000000S0000000000000S000000 
ooooooooooOoooo2oooo 0 ooooooooSoooooo 
ooooooooooooooo0 0 ooo 0 ooooooooSoooooo 

§8888§388§83§§8883§§§83S8§§§8§§88888 


SeqID 1 





213 



WO 02/077183 PCT/US02/09107 



Clone Name 


llllilllllliltlilliillllliiliiiiini 


SeqID | 


I/") V> V) KTi V> lO *0 v) irj w| vr> *«i Tl "l v > *»i <f) v » ** ' ** • M ' **» **' ^ 1 *•» • ' ' 


Clone Name 1 


l§8lii|8^8SSg5§S|8|§Sg|8£8u5JSoJuSg| 
llllll§§§§§§§§§§|§§l§§|§§8Sls||§§||§ 


SeqID I 


oooooooooooooooooooooooooooooooooooo 
^^^^^^^^^^^^^^^^^^^^^^^■^^^^^^^^^^^^^ 


Clone Name 1 


HH{SN^SH^r<1^^n^n^^(Mooooo^OO(S(N^Hf^^f^^{N^^^v^ooooo\a\HHN 
^^^^^oooooooooooo^T-H^^oooooqpoqooo^oo 

rM v^J M-l v-M t t v_/ v^/ hm MH -Tr 1—4 f~? ' v - / , 1— i f-M )>— | r~5 Ve j ttS ( Sr ir^ 

oooooooSSooooooo 0 ooogoooOooS°ooooooo 

8§888§§88gg88888g8g88ggg8888g8g888gg 


1 SeqID 1 




| Clone Name 1 


^NHts^ininONfSHro^Oho\OHinvo\ohhO\0(NnTrinin\ovo , ooo^oo 

OO^t-'OOO'— 'r-*r-<00000'-«>--iOOOOOO'— »»-«OOOOOOOOOf-''-i 

H-J r»H M-l MH >-/ w >I7 >— < V— 1 55 « t-H " W MH *™2 _J " 5T! ,Ti ^"S rH £H £T3 |— ( M-J l-H JT3 

ooooo^ooo2oooOoooooooOoSS°2ooooSoooS 

88§8|8l8 88888 8888888888 §§§888888g888 


I SeqID 1 




! Clone Name 1 


oooooooooooooooooooooooooooooooooooo 

oooo 0 ooooooooo 00 oooooOoooooooooooo 0 o 
oooooooooooo£?2oooooooOoooooooooooo 00 

000000000000OS0000000000000000000O00 
°oooqooooooooooooooooOooooooooooOOoo 
oooooooooooo^2 oo0000o0o00oooo00o0 2 00 


SeqID; 


^o^ooo^o^Nn^'ovo^ooo\Or-•^l^n't^n\o^ooc7^0'Hrsn^^n\o^ooo^o•H 

OOOOC»OOON<^0>ONONONC^ONONO\0000000000»-HT-H^^r^^»-<r^^^c^<S 



214 



WO 02/077183 



PCT/US02/09107 



Clone Name 


ssllssssssss|sss§sgsss p|||s ss|||p| 

ooggooogggoooooogooooooooogooooogooo 

§§§§SS§llS8§8§g§§§SSS8S§§oS§§§8§|g8S 


SeqID] 


i/> *0 iai id IT) to to 10 to to to to in *0 v> to vi */> vi »*» m ti »n *«i mi v) fi «*i «#t wi *4 j »#i mi mi mi 


Clone Namel 


ooooogooggoooSoooooooooooOoooooooOoo 

§§§8il88§l8g§8g§gg§gggggg8ggg8§8g888 


SeqID 1 


ooo^o^(Sm^^vo^oO(^o^r^(0^lOVO^ooo^OH^^^OTr^nvp^ooaO <S 
oooooooooooooooooooooooooooooooooooo 


Clone Namel 


OOOOOOOOOOOOO'— 'i— 'OOOOOOO^^^OOOOOOOO*— < 

ri£nntLri<f , ti][)iiiri<i'(imflfflrnriDQ!T!<i 4 nnMffi'!T!rtUWW 

{ J »r H L_i w V^J £*-* ^ Sr** w ^ M4 l-M W w M-l hm w W |JL| >-H W w w >-M ^7 rM *S 

' — < r — ^ ^ c — t^- r — r^- t^- r^- c*^ i — t^- r — r-- t-^> r^- t — LT; ^ >Ti — ^ ^ ^ 

o°ooSooooooooooooooooooooooooSoSoooo 
oSooSoooooooooooooooooooooooo£ogoooo 

g§ggogggggggggggggggggggggggggg§gggg 


1 SeqID | 


vo^ooo^o^Mrn^■^n^o^ooo^o^^srn^w^vo^ooo^O'H(^^n^•^o\o^ooo\OH 

0OOOOOO00OO0O0OO0O 00 0O00OO0O000O 00 0OOOOO00OOOO0O0OOO00OOO0 00 0OOOO00O0OO0 


Clone Name 1 


HH(NHH(SMTr^nvo\o^^^ooooooOH(^^fnnn^^lo\o^^^^ooooo^OH 

r -i^ T -HOOOOOOOOOOOOO^*^^»-'T-«0000000000000'-«*-| 

r > [t, m f?< ^ a f fY^ f H t < f ) ^ f ) rh hr* m hr 1 n n t r n cf> ^ ffl Ph oQ ^ 

oOoOooSoooOooOooooooooOoooSooooooSoo 
oOosooOoooOooS'^ooooooosoooSooooooS^o 
oooOooSoooOoogooooooooOoooSRooooooS^o 
oooo 0 o°ooo0oo0ooooooooc>ooo 0 oooooo 0 oo 


1 SeqID 


^lo\o^ooc^OH^^n^lovo^ooo^o^f^^n^>n^o^ooo^o^^smT|■u^\^)^ooo^ 
\ovovovo^o\o^^>^^^^^^^^ooMooooooooc30ooooooo^o^o^O\C^o^(^o^O\o^ 
v^io^^otniotototnvntn^u^tov^i/^iovntotnvnv^ 


| Clone Namel 


\o\o^^^^^^ooooooK)ooooo^o^c^o\00ooOHHH(ScsrJNrSMHHHM 

OOOOOOOOOOOOOOOOOO'-"'-'^'— i ^ i— i^hi-<^hi-iOOOO 

ooo£°2oooooOooooo 00 ooOooooooooo 0 oo 00 
oooSoooooooOooooo 00 ooo 0 ooooooo2oOOoo 

ggg88888888888g8 8ggggggg|l88l8 8ggggg 

aaaaaaaaaaaasaaaaasaasasaaaasaaasaas 


SeqID; 





215 



WO 02/077183 



PCT/US02/09107 



Clone Name 


OooOOOOOOOOOOOOOOO*^^^^^^^^^^^ , — , '-" ,r "" ,, ~~ ,r "" ,T *"'0 
" U ^ W V> C_J 7J ^ C m M m Mh {__) M-( >J_, (XJ y±4 ^w/ W ^ JT, hM M-( w JT2 >-M 

Oooooo2ooooOooooOoooooooooooo 0 ooogoo 
<=>oooooSooooOooooOoooooo 0 ooooo 0 oooOoo 

8§l88g888888g888888888888888888888gg 


SeqID 1 


CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO 


Clone Name 1 


'HMMM^^lfnMP^fncon^^Tl'^TtTtv^^oln»o^o\o^^^^^^^oooooooo 

oooOoooooo 00 oooo 0 oooooSoooooOooOoooo 

000O00000000000O000000O00000O0000000 
000O00000000000O000000S00000O00O0000 

888§§|8888§88888§888888g§g88888888§8 


I SeqID | 


^uivor^oo^o^<Nco^v^vor^ooo\0^cNco^w^vor^ooo>0'~- 

0000000000000»0\0\^O\0\0^0\0\0\OOOOOOOOOOHr- 


CSco^fU->vor^-00O\ 


Clone Name 1 


(NH^Hr^rn^n^nvo\ovo^^^wooo^o^OHTHrHf^^Hfsnr}■^oo(^0'-lf-l{SlSrHH 

^OOOOOOOOOOOOOOOO^T-H^T-^^oOOOOOOr-t^T-Hr^'—tOO 

o\ooooooooooooooooooooS^^^^^^^^'-^»- , '-*r^rMC; 
r^oooooooooooooooooooooooo^SS^^^^ooooSooooooooooooooooooo^ 

ooooooooooooooOooooooOoooooooooooSoO 
ooooooooooooooSooooooSoooooooooooSoO 

88 888 88§88888§§8§8 888§8§88888§§§8B§8 


1 SeqID 1 


cNco^invor^ooONOi^c>ico^^vor^ooOs0^c>ico^u^vor^oo^ 
^f^^Tf^^TtTf u->*ninto»n»ninir>mv^vovovovovovovovovovc 
oooooooooooooooooooooooooooooooooooooooooooooooooooooooc 


o»-icNco^mvor- 
0000000000000000 

(NN(N(NNtN(NfS 


1 Clone Name 1 


^N^^^^fOfo^^^o^ooc^ooHH^(^^(S^l^^f^T^^nlnv)^o\ovo^ooooH^|^l 

»-H^-tT—iOOOOOOOOO«—'i—ii-Hr-iOOOOOOOOOOOOOOO»—''— 'O 

oooooooooooOoooOoooooooooOo£oooooooo 
°ooooooooooOoOoOoooooooooo 0 Ooooooooo 

88 888888 888888 8888888 88 88 88 888888888 


1 SeqID 


O '-i (N ro ^- in voh- ooo^Or-iojn^mvoh-oocriOrsfNjM^io^h 
vovovovovovovo^ovovovovovovovovovovovovovovovovovovovovo 


cNrNcocococococo 


1 Clone Name 


(N m m m m ^ "^tt ^^^^»ninyovovD\ovor^r^r s 'Ooooooooooooa^chcri(^oooo 

OOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOi— 1«— 

o o o o o o oo o 00 oooooOoooo3<3ooOooooOooooo 

fOcocO{orororo^pDrococo<o^ 

ooooooooo 00 ooOoogoooOooooOooooOooooo 
ooooooooo 00 ooOoogoooo 00 ooOooooOooooo 
o o o o o o oo o 00 ooooogoooo 00 ooo 0 ooooooooo 

888888 88 8888888888888888888888888888 

a a a a a a as a aaaaaaaaaaaaaaaaaaaaaaaaaaa 


SeqID 


00 On 0 — * <n co rfm vor-oocNO*-<<Nco^-invor^oooNOr-irNm^fun 
m 10 vo vo vo vo vovo vovovovo^r^r^r^r^r-r^r^r-r^oooooooooooo 
co co co ro ro co coto rorococncocococococococococococococococo 


vor-ooosOt-«<Nco 

OOOOOOOOCTNONC^OX 
COCOCOCOCOCOCOCO 



216 



WO 02/077183 PCT/US02/09107 



Clone Name 


^^(S^^(^lN^(nnTr^T}■lnlo^\o^^^^^•<»Mooc^o^500000H»HriHr^ 

> ^ X (T. M r\ fr, M n rv Ir1 ^ rh A It. > /A fi. M > <V\ M [l. "T 1 <f* A Cl] h -1 fA LL. fh >T ffj 
H W □ ^ 53 P " UJ \J L_J W (_} W W ^ ^ W ^ V-/ >-H ^ »-M »-U L^J w' HH ) — { \-/ >-H »"H " W >-M ^ 

lg§8llglg§ggg§8§gggggggggggggggggggg 


SeqID ( 


Nro^ , ^\o^ooo\o^HfsnTl■^n^o^ooa\OH(sn^ , <nvo^ooo^OHcSfo^^«n\o^ 
^^vo^o l £vo\o\o^^^^^^^^^^ooooooooooooc)00oooooo^o^o^o^o^o^o^o\ 


Clone Name 1 


^_ ( ^4^-i T -<OOOOO0OOOOOOOOOO00000O»-^»— •»—*»— "OOOOOO 

oooooooboooooooooooooooooo 0 oooggoooo 
ooooOoOooooooooooo<=>ooooooo 0 oooSSoooo 

8li888888888888888888888888o§Bo88888 


1 SeqID 1 




I Clone Name 1 


OOOOOOOOOOOOOOOOOOOr^^^H^r-tOOpOOOOOppOO 
mw^ooooWoomw^wmoomMooooooooooooooMoocoWm 

oogooo 0 oo 0 ooooo 000 oooooooOoooooooSoo 

88§88888§§g8§88§8888§§8§§§888§8§8§88 


1 SeqID I 


oo<^o^MfOT^•>n^o^ooo^o^(Sco^^ovo^oo(^Ortf^^fn^■v^vo^ooo\o^^^M 
r^^oooooooooocoooooooooo^ONO>ONONO>o\ONONO\oooooooooo^r^^,--< 
oo oo oo oo oo oo oo ooooooooooooooooooooooooooooooonOnonOnon On o\ On on on ON On On on 


j Clone Name 1 


Mfn^<nhooooo\ooooOHH(SNn^Tfvno\ovosoo\0\o\OM(Sin»nvo\ooo 
©OOOOOO©*— ti— ti— it-Hi— »i— i»-hO0O00OO0O0OO»-«*-"OOOOOO 

ooooooooooo 000 oooooo 0 ooSooooooOooooo 
ooooooooooo 000 oooooo 000 gooooooo 000 oo 
ooooooooooo 000 oooooo 000 o 0 ooooo0 000 oo 

0ooo 22 O0o 22 OO0O00O 2 00O 2 o00o0O 2 00000 


1 SeqID 


^^ooc^o^r^lntln*o^ooo^OHf^|rn^v^\o^ooo^o^c^l^n^»o\o^ooO\0•-• 
vovovovovovovovovovovovovovovovovovovovovdvovovovdvovovovovovovovovovo^ 


1 Clone Name 


E3M10O00O3OH1O 

E3M10000030A11 

E3M10000030B11 

E3M10000030H11 

E3M10000030B12 

E3M10000030C12 

E3M10000030D12 
tj'STV/fi nnnfifNOfYd o 

E3M10000030G12 
E3M10000031C01 
E3M10000031A02 
E3M1 000003 1B02 
E3M10000031F02 
E3M10000031B03 
E3M10000031D03 
E3M10000031E03 
E3M10000031G03 
E3M10000031B04 
E3M10000031C04 
E3M10000031D04 
E3M10000031E04 
E3M10000031F04 
E3M10000031G04 
E3M10000031G05 
E3M10000031H05 
E3M10000031A06 
1 E3M10000031C06 
1 E3M1 000003 1G06 
E3M1000003VH06 
E3M10000031A07 
E3M10000031E07 
E3M10000031F07 
E3M10000031G07 
1 E3M10000031H07 
E3M10000031A08 
E3M10000031D08 


SeqID 


ONOso>ONO\ONOOOooooooo^^^^^^rM^^^r^c^(N(Nc^r^r^r>jrvir^ 



217 



WO 02/077183 PCT/US02/09107 



Clone Name 


l8§ll§Slll§§||§SlllsS8§§§o§88§§8SSS§ 

coWoiwcococOc/Scococow 


SeqID] 




Clone Name I 


ooooOooo§oooOooooo 0 ooo 0 ooogooooooOoo 
oooooooogooooooooooooooooogooppooooo 
ooooOooOoooooooooOoooooooo2 oo000 °2 00 


I SeqID I 


\o^ooo^o^(SnT^^n\o^ooo\OH(^|n^■^o\o^•oo(^o*-'^lc^^>nvo^'WO^OH 
^u^^^ovo^\ovo\o\ovovo^ovo^t s »^'^^'^^•^•^•^•ooooooooooooooooooooo^o\ 

^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 


Clone Name 1 


OOO^^»-«'-'000O0O'-^^0O000OO000OOOO'-«'-iOOOOO 
^ frl fr. /VI *-H m Tt") Ll, r*~i >rih-*r^[L¥r^>i0^iYlM rr, /~ 1 fr, It. > fr, A rh >*H f ") fjj 

ooooooooc^ooooOooooogoooooooSooOoOoOooooooo 

oopoooop 0 ooooo 0 ooooooOoopop 0 Ooooooop 
0 oOoooop 00 oooo 0 ooooooS5oooop 0 pooooooo 

88og§§§Sg8S8S§8S8§§§go§§88§8§§g8§S§o 


1 SeqID 1 


^^vo^ooc^Ot-'(Nfn^ln\o^ooa\o^(SroTr»o\o^ooo\OrH(^|^Tf^nvo^ooo^ 

O*^ 0*N ON CXs 0*s OS ^7n ^7^ ON ^7\ ^7\ OS ON ^7^ C^N OS ^7n ^7^ 


j Clone Name 


ooo\(^o^oooHHH^^(NHM^^f^|^o^o^^^^(X)o\c^OH^)fs^^mfO\o^^oo 

OOOO— «^i-Hri^^»-*T-HOOOOOOOOOOOOO'-»00©0000000 
(V) r "i t-H rr*\ /**\ Ci_. /■> ft") Cu n"\ /"n r*™i > rft f W n pa fr. r ^ r>"i /"n ^> rrl /-a r n rri h-< /"~\ r^i hr< 

^ONO\c^ONa\^ON2>^oso\ooooooopoooooo^-t t -i^r^^H^H^H ? -^^H^H 
<^c^oNO\aNo\^aNO\^aN<^ooooooogoooooooooopooooo 

oooooogopoooooooooogoooooooooopooooo 
oooooogoooooooooooogoooooooooooooooo 

88§8§8§8§o88§88§8§8§83883§88§§§§888S 


1 SeqID 


Nnt^^^ooo^OH^)^^lA^^ooa\o^HrSf0^^n\o^ooa\OrMf^^n^■^o\o^ 
t^c^r^r^t^t^c^c^ooooooTOoooooooo^TOON^ONONC^cnoNONaNONOOOOoooo 


1 Clone Name 


Mco^o^^OOOHHHHHfvifviHHHHMtsrsriNfnwfnn^t^tin^inin 

00000»-''-<'-<»-«i— »»-tOOOOOOOOOOOOOOOOOOOOO 

' . r*^ >^ , >*» r*H _ « ^ >^ ^ >»✓ ' J M— ( r™^ Sw/ > ' f r™^ J \^ 

ooooooooooOoooooog 00 oo£oooooSoOoooOc> 
ooooooooooOooooooSoooo^oooo^ooOoooOo 
ooooooooooo 0 oooooSooooQoooo2ooooooo 0 

88 8§§88 888 3888888o88 38§8888§888§388§ 


SeqID 





218 



WO 02/077183 



PCT/US02/09107 



Clone Name 


S1M10000034E07 
S1M10000034F07 
S1M10000034G07 
S1M10000034H07 
S1M10000034A08 
S1M10000034B08 
S1M10000034D08 
S1M10000034F08 
S1M10000034G08 
S1M10000034H08 
S1M10000034A09 
S1M10000034B09 
S1M10000034C09 
S1M10000034F09 
S1M10000034G09 
S1M10000034H09 
S1M10000034B10 
S1M10000034D10 
S1M10000034E10 
S1M10000034F10 
S1M10000034H10 
S1M10000034A11 
S1M10000034D11 
S1M10000034E11 
S1M10000034G11 
S1M10000034A12 
S1M10000034B12 
S1M10000034C12 
S1M10000034D12 
S1M10000034E12 
S1M10000034F12 
S1M10000034G12 
S1M10000035B01 
S1M10000035C01 
S1M10000035D01 
S1M100O0035H01 


SeqID 1 




Clone Name 1 


oooooooooooooooooooooooooooooqooopoo 

8§§§§8§8§§8|§§888888888§888§§88§§§§§ 

ooSoooooooogooooooggooogooooOooooOoo 
oo 0 oooooooo < = > ooooooooooo2ooooooooogoo 

coc/Jcrtwcococococococo^w 


SeqID 1 




1 Clone Name! 


O CD ^D O CD CD CD CD C> CD t—^ O CD CD CD ^D CD ^D CD CD ^ CD CD CD CD O ^D ^D ^D CD CD CD CD 

oooo^oooooooooooocdooooocdoooooooooooo 

SoooSooooooooooooOooooocdooooooooooOO 
°ooo°ooo. 000000000O00000O0000000000OO 


1 SeqID I 


0'H(^^nrt^n^^oo^OrHcsrn\^^vo^wo\O^H(Nn^^ovo^-ooo^o^N^o^u^ 


| Clone Name 1 


0\0\^000)r^^OOCOO\OHrt^(3C^O^NfSNrnrninVOhha\OOCN(NrnoOO 
CD ^D ^D *-< ^D ^D ****** ^D ^D r "™ < CD CD CD ^D ^D ^D ^D ^D CD CD O CD **** ♦ — * t— < CD CD ^D t *™' 

SooooS°oooooooooOOoooooooooocdooocdooo 

£oooo2oooooooooo£CDooooooooooCDoooCDooo 
SooooS c:> ooooooooo22 0 cdocdoooooocdooocdooo 
2oooo°oooooooooo 0 °oooooooooocdoooOooo 

2000030000000000^0 


1 SeqID 1 


ooc^o^cs^^^vo^oo<^o^^cn•^^nvo^ooa\OHfsn•*u^\o^oo<^or-.cscn 


1 Clone Name' 


OOOOOOOOOOOOOOOOOv-t^^-'^r-t^r-i^^-t^-ir-H^^-.OOOOOO 

0000000S000S000000CD00000000000000S00 
0000000S000S000000CD00000O00000000S00 
0000000S000O000000O00000000000000S00 
oooooooRoooOooooooOooooooooooooooSocd 

ooooooo2 oo0 oo000o0oo000 o0o ° 0 

aaaaaaasaassaaaaaaaaaaaaaasaaaasssaa 


SeqID , 


\o^ooo^OH{N^n^^»o^»c^o^(Nfn■^^ln^o^ooo\OHC^ , n■^r^n^^ooo^OH 
\ovo\o\o^r•^^^^^^^•^oooowoocoooooooooooc^o^^O\o^c^C3^c^o^o^oo 



219 



WO 02/077183 PCT/US02/09107 



Clone Name 


S1M10000035C02 
S1M10000035E02 
S1M10000035G02 
S1M10000035A03 
S1M10000035B03 
S1M10000035E03 
S1M10000035F03 
S1M10000035B04 
S1M10000035C04 
S1MI0000035D04 
S1M10000035E04 
S1M10000035F04 
S1M10000035C06 
S1M10000035D06 
S1M10000035H07 
S1M10000035A08 
S1M10000035B08 
S1M10000035E08 
S1M10000035H08 
- S1M10000035A09 
S1M10000035D09 
S1M10000035E09 
S1M10000035F09 
S1M10000035G09 
S1M10000035H09 
S1M10000035A10 
S1M10000035H10 
S1M10000035A11 
S1M10000035B11 
S1M10000035C11 
S1M10000035G11 
S1M1O000035H11 
S1M10000035A12 
S1M10000035D12 
S1M10000035E12 
S1M10000035G12 


SeqID | 


^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 


Clone Name! 


22222^£^^^^^^^^ooooooooooooooopooooo 

§§§§§§ gsl§§g§§gs§g§§§§§§sis§||§|ii§§ 

Sr^r^oSSoor50ooOOoOOo00 0 oOOOOooOOOOoOOO 

oooogggggoggoogoooooSSoooOoooooooooo 
cowcococoEiococowcococ^w 


SeqID 1 




Clone Namel 


oobooooooooboooooboooooooooooooooooooo 

0 ooooo°ooOOooooooooooooooooooooooooo 
o°oooo°oooOooooooooooo 0 ooooooooooOoo 

O w O O O O ^-s O O ^— 3 y O O O O O w O O O C- 3 5C O O i=C w O W O C_3 O 5X v— J 5^ 

222 22°22 222§2222222222222222 222 2 2222 


SeqID I 


^^ooo\OH(sn^J■lnvo^ooo^OH^^ro^^|^lvo^ooo^OHr^^nTr^nvo^ooo^OH 

00 00 00 00CA0\0\0\0\0\0\0>C>kC^OOOOOOOOOO^^^^^^^^^2^C^M 
ONO\a\0>OsO\0\ONONO\ONO\C^O\0000000000000000000000 


\ Clone Name! 


t-»^i— r»— <OOOOOOOOOOOOOOOOOOOOt-H»— »»-hOQOOOOOO«— 1 

qSSoooooooooOoooooooooSoooooooooooSo 

OoooooooooooOooooooooo°.oooooooooooOo 
2 0 0 0 00 000000O00000O000S00000000000S0 
0 0 0 0 0 0 oooooooooooooooo^ooooooOoooo^o 
°ooooooooooooooooooooo°ooooooooooo2° 


1 SeqID 1 




Clone Namel 


rSNfS(S^nn^n^^^^^^^ , ^>n'o>o^nv)'n'n ( ovo^o^o^^^^^oooooooooo 
000000000000000000000000000000000000 

OOoooOOoOOoOO^oqoOoOOqoOOOOOOqooOOQo 

oogggooggogoogogoogooggogggoogggooog 
oo 00 o00oooooo 0 ooooo00 00 oooo00 000 ooo 0 

8§8§§§§8 88§§o888§§8§§§88§88§o8888§§8 


SeqID 1 





220 



WO 02/077183 PCT/US02/09107 



Clone Name 


oooooooooooooooooooopoqoppooooopoooo 

S8S3SSSSSSSSS88S8SS2SS 

iiil§§§i§lis§§§i§§ll§§§il§||§s§l||§| 


SeqID 




Clone Name | 


oooooooooooooooooooooooooppopoooopoo 

ooo 00 ooooooSoooOooOooo 0 ooooooooooooo 
oooggooooooSooooSoogoOoooooooooooooo 

§gg§§gg§§gg§g§gggg8gg§ggg§Sggggg8ggg 


SeqID 1 


vovovovo^v©r^i^r^r^r^t^i^r^r^r^ooooc»oo<x)oooooooooo 


Clone Name 1 


OOOOOOOOOOOOOOOOOOO*— i ^— • ^— < i — »i— «oooooooooooo 
mSoooooooomcoooMooooco2wMoooo»ooS«)MOOooooW 

oooooooooOoooSo0 0 osoSoooooS2 o0 SSoooo 
oooooooooOoooOo0 00 o 0 goooooOooooooooo 

888 88 8888§§g88S§8§§8§8.8 888 8§8§§§8§§8 


| SeqID 1 


Mn^io^^ooaOH(Sfo^in\or*ooo\0»-<cS(n^ , »n^t^ooo\Oi-'rsnrt , »nvot^ 

O O C O C_? (_> O C (_> C^> C_3 (_> O O O G> O C_J t_J J G> v^? <• — ' v — > v — J < — > v — J < — ) 0— J < — •> t_-> ' — -> 


Clone Name! 


H-HtsN^^voooooHr^(Sfn^n^o^n^o^^l'^T}'^oln\oooH(n^ntnoo^^Hr^T|■ 

i-'OOOOOOO^'-'^r^OOOOOO^^-'OOOOOOOi-'OOOOT-iOOO 

y"-^ mm r t » -J j— v r T l k— — | r \ r r— ■> r K f-r «rt r \ r)-i fyl MS r \ frl fr ^4 fY f \ f \ /VN M rt^ /~\ r \ 

ooo 00 oooooOoo2oooOooooOoS 0 °ooooooooo 
ooo 0 oOooooOoogoooOooooo 0 oooooooooooo 

oo c - > ooOoooo 0 oo <=> oooQoooooo°ooooooooooo 


1 SeqID 1 


o^^s^n^u^\o^ooc^OH(s^o^^oyo^c)oo^Or-lNfOTt^^>\o^ooo^OH^|r1Tl■^n 

000000000000000000C00M3\0M3\0\CA0\(^O\O\OOOOOOOOOOHHHriHH 

t c t-> t i*- t r - *- i » r*^ i ■ r-^ r - — t r*» oo oo oo oo oo oo oo oo oo oo oo oo oo oo oo oo 


I Clone Name 1 


OOOOOOO — ir-<— <»- iT-ir-.^^^^^^OOOOOOOOOO00OOOOO 

888888888§8888888o8888888o888§§88888 


SeqID I 


ooc^o^(Nnr^^n^^ooo^OH(sro^>ovo^ooo^OH^|fp^T^^nvo^wc^O'-'f^lfo 



221 



WO 02/077183 PCT/US02/09107 



Clone Name 


SSS8SS883SSSSSSS88S§SSSSSSS28S§SSS§2 
§ill§|8lS8|l888§8§§§88888§8888§888§8 


SeqID] 


rslrO^^vOI^OOOsO^CNc^TrinvOt^OOCAO^CN^^^VO 


Clone Name | 


oooooooooOooooSooSoooooo 0 ooooooooogo 

8i88§l88888888888o888888888888888888 


SeqID 1 


Or^(S(nTr^vo^M^OH(S{n^lnvo^ooc\OHcs^^^vo^wc^OHfsro5«o 

l T 1 I' | * ' » \ ' 1 \^ ft v 1 J \ J t \ * \ 1 * 1 * » 1 1 1 ! ■ ■ • * 1 » » » ■ * t * ■ » * ■ * ■ ' A * A * * ^* _ .* » , - * - >- - .. a . »■ - | 


Clone Name 1 


oo^^^^^ooooooooooooooooo^^i-^^opoooooo 

3< ^4 ^ r \ /v\ r T . rh <v\ hh ^ [l r \ n > m > fh r tti r\ it! ttI [l hr n n tii r^i m n [i3 

oowoomooooooooooooooooWoooooooooooomooMoowsS wwm w wm ^ 
oooooQooooooSoooooooooo£02oooS° 0000 S 

888§8§88888§§888§8888§8§§§88§o88§8S§ 


1 SeqID | 


ooo^0^f^ro^lnvo^ooo^OH^lrn^^o^0^ooo^OHfS^OT}■»nvo^ooo^O'-'r^n 
^ln^>ovo^o'^)y^vovovovo^^^^^^^^^^ooooooooooooooooooooo^o^o\o^ 

c~\ /— «i f*> <■— > r~\ /— \ j /—s /-^ /— \ »^~\ /—\ C""} <-> £-5 q c~> c~i f~5 0 CD CD CD CD CD 


Clone Name 1 


^^^a3ooc)0(^o\c^o\^H^^nn^vooooor^l(Nf^(o^o^^^ , t^n»olnvo^ooo 

OOOOOOOOOOOOOOOOOO'-'^^^OOOOOOOOOOOOOi-' 
V-M (_J <T LJ I-m i-M J-M (_j nj Qj £_j ) (JJ Um L_> Mh m M-< ^ r_j \_J <5. l-M CU J-M Lm L_j LC <L iXj <L 

oSoooooo 0 ooooooooOooooooS°oQOooooooo 
oSoooooo 0 Soooooooooooooogooooooooooo 

o°oooOoo c 5 0 oooooooOoooooo c> ooooooooooo 


1 SeqID 1 


vot s «ooa\Ot-<(Sfn^ , «ovor**ooa\o«-*(Nro^ , *nvot N >ooaN. o»- '(Nm^mvor-oooNO^H 
000000000000000000000000000000000000000000000000000000000000000000000000 


1 Clone Name 1 


000000000000000000000000000000000OOO 

oooo0 00 ^oooooo 00 ooOooooOoooOooooo 0 oo 
oooo0 0 oSoooo00 0 oooOooooOoooo 0 ooooooo 
ooooo 0 oSoooo00 00 ooQooooOoooo 0 ooooooo 
ooooOooS*oooooo 00 ooOooooOoooQooooOooo 
ooooOoo 0 oooooOoooo 0 ooooOoooOoooooqoo 


SeqID, 


^^vo^ooc^o^Hrsr^^*n^^ooc^o^Nn^^vo^ooo^o^H^^Jr^^■v^\o^ooc^ 
r^t^t^r^r^r-c»cooooooooocoooooc«c^^^c^^c^^c^c^ONOOOOOooooo 



222 



WO 02/077183 



PCT/US02/09107 



Clone Name 


oooooOoooo 0 ooooo 00 oOooo 0 ooo^oooooooo 

Soo§oOoooOooooooooooo00 0 oooo 00 oooooo 
Soo§oooooo§oooooooooooo 0 oooo 00 oooooo 


SeqID) 




Clone Namel 


rocococo^^^^w^u^in^invovovor^r^r^r^r^r^ooooooTO 

oooooooooooooooooooooooooooopopoppo^ 
co^^co^^cococr^cococo^^coco^cof^ 

838§8§888388§388888§§8S388§838888§§8 


SeqID 1 


\or^coosOT^c^coTfw^vor^ooa\0^(Nco^^^or^ooa\o^ 

ro ff"\ rO ro ro cn co CO CO CO CO co CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO 


Clone Name I 


^ooooc^oo^^N^^cs(SN^w^voooooooc^OrH^<srsrn^■TJ'^o^ovovovovo^ 
OOOO— '^oooooooooooo^r-Hoopooppppopop 

ooooooooooooopoooopoooooogoooooooooo 

88888888888888888888888888g888888888 
SESSSwuwwwSSw 


1 SeqID 1 


Tru^^^ooa^o^rMf^^ln\o^ooo^o^NcnTf^^^^oof^OH^|^o^^^\or^ooa\ 

CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO CO 


Clone Name 1 


^^^^^OOOOOOOOOOOOr-.T-HT-HT-HT-H^r^r-HOOOOOOOQOO^ 

f3^cOcocococococo£OrocoJOrOrncOcncoc^ 

Soooooooo^oogooooopopoooooOooopooooo 
Sooooooooooosooooooo°ooooOOoooOooooo 
SooooooooooogoooooOoQooOoOOoOoOooooo 

^OOoOoOOO^OO^OOOoOOo^OOOoOOoOo^OOOOO 

gggggggggggggggggggggggggggggggg 


1 SeqID 1 


^l^^^^^o^ooo\o^r^fn^w^vo^ooo^OM(Nco^^^^ooc^Or^fS^nT^^n\o^ 
u^^n^w^ln^rl^nlo\o\o^ovo\ovo\o\o<)vo^^^^^^^^^^cooooooooooooooo 

00 OO OO 00 00 00 00 00 OO 00 00 00 00 00 00 00 00 OO 00 00 00 00 00 00 OO 00 00 00 00 oo oo OO OO OO 00 00 


I Clone Name 1 


Prl Ll. rO ^ ( \ ttI Ll r*i rf\ f"\ rrl Ci- r*~\ rc\ r /-\ tt! El. r^i hh nc\ f % ItI Lr- r>*\ r ^ ct^ r*~\ ro >r< ^t 4 m r "\ /**s 

>— n v. ltJ V v«/ »^ V,^/ ^jl^ r ^ v^/ M-) V^/ [ i .J v J i ' | V,^ / >-H M-j v*^/ »-M V„^ V / ) 1 | M-i V™' 1 1 

cO£3.cOcococO£3cOcocOco£2fJDroc^ 

ooooooooooooo 0 oooooo 00 oooooooooooooo 
pOooooooooooOoooooo^ooooooooOooooooo 
oo 0 oooOoooooo 00 ooo00 0 oooooooOooooooo 

OO0OOOO00OOOO00OO0OO00OOOO00O0OOOOOO 


SeqID 





223 



WO 02/077183 PCT/US02/09107 



Clone Name 


sil8ill8ll§8slll§BS§S§§§8§g8§§§S8SS§ 


SeqID | 




Clone Name | 


^^^^^^^^-.^-(r-H^H^-it— i^ooooooooooooooooooooop 

oooSooooooooSooooooSooooSooooSoooooo 
ooogggoooooogoogoogooooogooggooogggg 


SeqID I 


r- M^hM^oooooooooo(»ooooooooo\^c\o\o\^o\o\o\o\Oogogooo 


Clone Name 1 


O O O ^ — ' ^ — i^^^»-hOOOOOOOOOOOOOOOOO»- , ^'- < ^*-"000 

8§o8§8§§8§§§§8888o888§8§8§88§o§§§8§§ 


1 SeqID I 


r*"» r**i ro ro ro c~i co ro co ro ro co ro ro ro rn n ro co ro ro ro ro cO ro ro CO ro ro ro ro ro ro CO 


1 Clone Name 1 


(N^^nn^nrf^nln^^^^^ooo^o^oO(N(Nr-l(NfSfSCNrnrn^^>n^^cr\0'-< 
r^0OO0OO0O0OO0O00O'~'-H--'^0000OOO0OOO0O»-»^ 

gogo£oooo£oooogooooogoooooooogOoOooo 
Oogo£ 0 ooo£oooo£ooooooooooooooooo0ooo 

2 0 2 0 2 OO0o 2 000o 2 oo0oo 2 0o0 ^ 0O0O 2 0000 


1 SeqID 


oo^o^cNn^invoNooo\0^cSfn^vi\Ohooo\OHrsntin\oi N 'Ooo\0'-'(Sn 

OOC»OSC^ONC\ONO\0\aNC^C^OOOOOOOOOO^»^^^^^'-^*--»'--t»-<<SCN<SCS 

oooooooooooooooooooooooo^o\^o\^o\C\o\a\0\os^o\c^ONO\0\o\ON^o\a\ONO\ 


I Clone Name 


(nMfn^Tr^^t^^v^^^^vo^^ovo^o^^^^^ooMooooooooo^c^o\0\c^o^ 
pooooooooooooooooooooooooooooooooooo 

f ^ ^ > A ttl tL hr> > fi^ tL M > w f ^ A rn hri > f Wri >n > r a r?i ft. M > w r \ a Pt, w 

goooooo 000 oo 00 ooooooooooooooo 00 ooogc> 
goooooOoooggooooooooooooooooo 00 ooogo 
goooooo 000 oo 00 ooooooooooooooo 0 oooogo 
goooooo 000 oo 00 ooooooQooooooQ0 0 oooogo 

^OOOOOOqoqOOcDOOOoOOOOOOoOOOOOooOOO^^O 

asaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaasa 


SeqID 





224 



WO 02/077183 PCT/US02/09107 



Clone Name 


S1M10000038G08 
S1M10000038A09 
S1M10000038B09 
S1M10000038D09 
S1M10000038F09 
S1M10000038H09 
S1M10000038C10 
S1M10000038D10 
S1M10000038E1O 
S1M10000038F10 
S1M10000038G10 
S1M10000038A11 
1 S1M10000038C11 
S1M10000038D11 
S1M10000038F11 
S1M10000038G11 
S1M10000038H11 
S1M10000038A12 
S1M10000038B12 
S1M10000038C12 
S1M10000038DI2 
S1M10000038E12 
S1M10000038F12 
S1M10000038G12 
S1M10000039B01 
S1M10000039E01 
S1M10000039A02 
S1M10000039B02 
S1M10000039D02 
S1M10000039F02 
S1M10000039H02 
S1M10000039F03 
S1M10000039G03 
S1M10000039H03 
S1M10000039C04 
S1M10000039G04 


SeqID 




Clone Name | 


S^^^S^OOOOOOOOOOOOOOOr^i-Ht-ii^T-iT^^Y-Hi-Hi-H^i^^^^ 

oooooooooogooooSoooOOooooOoooooooooo 
SooooooooooooooSSooOOooo-ooooooooooOo 

oooooooooooooooooooooooooooooooooooo 

LJ , i I_3 U-< »-H i-H i-H '- , t— ( r— <t— l f--*r-^»- H »— 1 " ' — < »~< T-H ^ j r-H l-H ,—1 i— H i— ( I-H i-H i-^ 

cowodwcococowwcococo 


SeqID 1 


c)oo^o^H^^(o^^n^^ooc^OHNrn^^\o^oo^^OHf^lfo^t^\o^ooo^o^Nfn 


Clone Name I 


OOOOOOOOOOOOi— t^i— ir-i^- ir-*oOOOOOOOOOOOOOOOOO 

ooooooo£ooOooooo 00 oooooo 0 oogooooooo 0 
oo 00 ooo2oo55oooogooooooooooo^ooooo^^o 
ooooooo 0 ooOooooo 0 oooooooooo5ooooooo 0 


! SeqID | 


\o^oo(^o^fSfn^ln^o^ooc^O'- < (Scn^^vo^ooo\0'-'(Sfn^•l^^vo^^ooc^OH 
\ovovovo^^'^•^^^^^^^oooooooooooooooooooo^a\0\0^o\a\C^o\o^o^OO 


| Clone Name 1 


i^-ir-iOOOOOOO^*— •OOOOOOOOO'-''— ir-ir-iT— i t-« t-h O O O O 

* — ' i — 1 i — ( r-H ' — ' r— « i-H , — I »-H i — t t < « — — '"""I' — • ' — • 1 — < 1—1 * — ' ' — < «—< ' — If—iT— l<— — 1 «— 1 f—J , — | r— < »— « »— < ^ 

OoooOooooooooOooSooooooooooooooooooo 
oooo^oooooooooooSoooOooo. oooooooooooo 

O^o^OrtO^OO^OOOrTi^Of^rtr^Oor^OOr^OOOOOr^OOOO 
°000°OOOOOoOOOoO°OOOOOoOOoOOOOOoOOOO 


1 SeqID) 




| Clone Name i 


OOOOOO^t— '^(Nc^^^^^cNf^c^cNr^csrnrorncnrnTfrT^ 

^^^j-i^^^h^h^^,— rOOOOOOOOOOOOOOOOOOOOOOOOO 

ooo^oooooooooo 0 ooooo 0 oooOoooooOooooo 
oooSoooooooooo 0 oooooooooo 0 ooooOooooo 
ooo^oooooooooo 0 ooooo 00 oo0 0 ooooOooooo 
ooo2oooooooooo 0 ooooo 00 ooOoooooOooooo 
ooooooooooooooooooooggooogoooooooooo 


SeqID 


OOCOOOOOCOOO(MaDONC^O\ONChONC^^C^ONOOOOOOOOOOi-Hi^r--(r-^»-(^-(^-4^H 



225 



WO 02/077183 PCT/US02/09107 



Clone Name 


T^^o»n>n^o^^^^^^^^ooMooooooa\^^^oooooo^H^N^j2on 

lislliilii§iiiss§i§s§l§§s||§§§l§llll 

li8ll§§l§S8SSS§§§S§SSS888SSg88S8SSSS 

K^^^ ^ ^ ^ ^ KTi ^ ^ ^ ^ ^ <n ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ 


SeqID 




Clone Name | 


SSSSSSSoooooooooooooooooooooooooppo 

§8§§888S8§888§o888|88888888882§§8§8§ 


SeqID | 


«-X —J- «-t- «-h T-f- t-i~ «-+■ rrt- -cr -^t- r-i- Trh r*" Tt - rrt* ^t" t+" Tt" Tf- T^" T± ^t" Tf" Tt" T±* ^T" "^r 


Clone Name | 


oo^^^^oooooooooooooooooo«-«r H OOopoopopp 

VOVO\O^^^OOOOOOOOOOOOWOOOOOOOOO)(X|00(»OOOOOOMS H ^ HH ' HHH I1 

ooooogooooooooooooooooooog^ooooooooo 
ooooooooooooooooooooooooogoooooooooo 
ooooooooooooOoooooooooooo°^ooooooooo 


| SeqID 1 


Nfn^vl\o^ooo^o^f^^^^vo^ooc^o^^^ro^^n\o^ooa^OH(s^^^•l^lko^ 

(NfSCSfS(N(N(NfStN CN <N CN <N tN(N(SMfSfN(NrN (N 0^4 (N CM (N N (N fS (N (N (N CN (N CN 


I Clone Name 1 


OOOOOOOO— •^OOOOOOOOOOOOOOOOOOOO'— <^ooo 
ONO\ON^C^^<^<^C7N^ONOOOOOop0 0 oOopoS 0 9 00000 ^^^ 

oooooooOo°oooo2Sooo 0 oooooSoooooooooo 

88 8888 8§8888S8§§88o8888 88§8 88 8888S88 


I SeqID 1 


^ovo^o^^^^vovovo^^^^^^^^r^^oooQooooooooooooooooo^o\c^o^c^o^ 
On on on On on on on on on On On on on on On on on On On on on on on on On on On On On on on on on on on on 


1 Ci one Name 1 


\oo^^^^^^ooooooooo^o^oooOH-HHr^(^|(^^(NH(^^(^)tsc^lfS^^mrnmn 

OOOOOOOOOOOOOO^r-.^-.r-H^^^^^r-r^OOOOOOOOOOO 

Oooooo 000 oooooo^ooooooooo 0 ooo°oooooo 
SooooOqqqoooooo^ooooooooOooooCoooooo 
o 0 oooo 000 ooooooOooooooooo 0 oooOoooooo 

088888888888888088888888888888888888 


SeqID 


c — t — r^- r~- r^- c^- r^- r^- r-- [ — r~- t^> r~-> r^- r->- i — r->~ r^- r^- t — r— r~~- r^> r*-- 



226 



WO 02/077183 



PCT/US02/09107 



Clone Name 


^SSoooSSoooooooooooooooooooppooopoop 

§sl§§sl||§8|i§is§§8§§§§§§§|§l§§§§§§l 


SeqlDl 




Clone Name 1 


LJ KM I J LJJ M-4 ■ Mh L J UJ v_/ V-M HH U V^/ P-t V^/ \_/ I— | V— ( >-H i i >— » ^ ^rf r^r 

§8§§88§§8§§§§88§8S§8§§88§88§§§8§8§§8 


SeqID 1 


OOOOOOOOOOW(»»OOMO\0\0\^0\ON^O\0\^OOOOOOpOOOHrtHHHH 

555533333333333$333333$33$$$$3333333 


Clone Name 1 


OOOOOOO— **-<i— ■ ^OOO— 'OOOOOOOOOOOOOOOOOO «— 1 r- 1 

0 ooo 0 oOoooOooooo 0 ooooOooSoo0 0 oooooog 

8888888§88888888888888§§888o88888888 


1 SeqlDl 


ooc^o^(Nn^^^o^^»a\OrH(Nro^ln^o^ooo^OT-<(N(ntl■^AVo^ooo^o^fS^ 
n fn co ro ro co co c*> c*"i co co en r**> ro Ci ro c**i ro m fo fl c*i m ro <■*"> ro ro ro ro to m r<~> ro 


| Clone Name 


OOOOOOr-H^Ht~<000^00000000000000'-''-**-'000000 

ooo22ooooo£ooOoooqo 0 o 0 oooooSoooooOoo 
oooooooooo0 0 oOooooo 0 o 0 oooooSoooooo 0 o 

S§g§S8§SS88g88§g8§S8§S§§SS8§SS§888s§ 


1 SeqlDl 


\0^ooo\0»HfSM^^nvo^ooo^OH(Nn^^^lVo^ooo\0'-t{sn'^l•^n^^ooo\01-^ 
ONONOsOnOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO 


! Clone Name 


n^T^^t^lnw^v^^n»n^vovoy5\o^^^^^^^ooooooooooo^o\o\o^ooooo 

OOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO'-^-H'-H'— ' 

>»✓ r«M (-^ tTJ M<4 >— ' kM r** v,\ 4 M-] \_J i I i <L (JLj (-M HH U JL <L LLJ f | M-4 ■ 1 . (If J, IJ r l MH 1 I . 

ooooo 0 ooooooo 000 oooo 000 ooOoooOooooSo 
ooooo 0 oooo 0 oOooooooo 0 ooooOoooSoooo2o 

888§8 88 8 88 88 88888§8§8 88 8-8 88 888§88888 

aaaaaaaaassaaaaaaaaaaaassaaaasaaaasa 


SeqID 


^n^^>nJou^^nvovovovovo\o\ovo\ovo^^^^^^.^^^^oooooooocooooooooooo 



227 



WO 02/077183 



PCT/US02/09107 



Clone Name 


l§iSg§i§|§I§8§§SS§So8gggg§gg§§g§g§§§ 

oo^o^ooooSooooooooooOoooooooSSoooooo 
oooo§ooOO§§oooooooooo 0 oooooooo 0 ooooo 

wwco^cococococ/acococow 


SeqID 1 


ooo^OHNnTrrlVO^<»o\OH^^^T^^^^ooo^o^w^n^^^^ooc^o^N^n 


Clone Name I 


^rtlo^lo^n>nlovD^o\o , ^)^^^^^oooooo^x)ooooo\0\0^o\(^5000ooOH 

oooooOoSSoooSoSSoooooOogoooooSoooOoo 
oooooo05Sooooo 0 ooooooo 00 ooooOooooOoo 
ooooooOooooooo 0 ooooooo 00 ooooOooooooo 

oooo8ooB§§oooo§ooooooogSoooooSoooOoo 


SeqID 1 


\o^ooo\OrH^rn^^<)r-ooo\OHfsn^unvor^oo^O'HfS(n^in^ot^oog)0'-< 


Clone Name 1 


fS(SNlnooo^fS^^vo\oooo\(^oo^|H^)(^^nnfnt'n\o^^ooooo^OH^-<(NH 

^-,^ T -,0OO00OO0O00i-i'-^i-«OOO0000000000Q^^^-Jr-'O 

<^<> c >S 0 oQSooOooo 0 o 000 ooOoooooooooooog^ 
oooooo°,gooOooOo£oooooOoooooooooooo°o 

§S§§§3o§SSo8S8§8S88§8o8§S§8§38§888§§ 


1 SeqID | 


Ttlr)\o^oo(^o^^^n^^vc>^ooa\Or^^sn^•^nvo^ooo^o^^^l^n^■^nvo^^ooo^ 
c^t^t^r^r^t^ooooc»oooooooooooooo<>o\o>o\o\<><^o\o>c7NOOOOOOOOOO 

ro co co co co co co co to co co co CO co co co CO co co co CO co co co co co co cn co co co co co co co co 


1 Clone Name | 


tT^^ooooc^OOO^fStNrnn^^ln^^coooo^o^^(N{^l[Nnnn^^^oooocS 
000000^^,— ^»-4»-«OOOOOOOOOOOt-Hr^^-iOOOOOOOpOi-' 

£oooooooOoo£ooOooooooooooooooooOoooo 
SoooooooooogooOoooooooooooooooosoooo 
gooooooooooooooooooooooooooooooooooo 
^ooooooooooooooooooooooooooooooooooo 


[ SeqID i 


pooooooooooooooooooooooooooooooooooo 


1 Clone Name 


cococofJ^jJ^cocof^co^^xcoco 

000000000000(^0000000000000000000002^ 

ooooOooooooo 0 ooooo 0 oooOOooooOooooo2<=> 
ooooo 00 ooooo 0 ooooo 0 oOoooooooooooooSo 

888888838388§§88888§88888888§88888o8 

aaaaasasaaaaaaaassaaaaaaaaaaaaaaasaa 


SeqID 


<^0\0\<^0\ONOvO>^CT\0000000000^^^'^^r^^^^r^rScN(NCNCNCN 

^^^^^^^^•^^oooooooooo«)oooooooooooooo(loooooooooooooooooc!ooooooo 



228 



WO 02/077183 



PCT/US02/09107 



Clone Name 


S1M10000041E06 
S1M10000041G06 
S1M10000041B07 
S1M10000041D07 
S1M10000041H07 
S1M10000041C08 
S1M10000041D08 

S1M10000041H08 
S1M10000041E09 
S1M10000041H09 
S1M10000041C10 
S1M10000041D10 
S1M10000041G10 
S1M10000041C11 
S1M10000041F11 
S1M10000041G11 
S1M10000041B12 
S1M10000041D12 
S1M10000041E12 
S1M10000041F12 
S1M10000042E01 
S1M10000042F01 
S1M10000042G01 
S1M10000042B02 
S1M10000042C02 
S1M10000042F02 
S1M10000042B03 
S1M10000042E03 
S1M10000042G03 
S1M10000042A04 
S1M10000042D04 
S1M10000042A05 
S1MIG00G042C05 
S1M10000042F05 
S1M10000042H05 


SeqID 1 




Clone Name 1 


ooOOoooooogoooooooo 0 ooo 0 o 00 ooooooooo 
o§oooooooogoooooooooooo 0 gooOoooooooo 

o§oo8ooo§o8§g888ooO§ooo§o88ooooooo5o 
wwoiWcowwBt«w w wwwwwwcoWcowoow w w ww wizi 


I SeqID 1 




Clone Name I 


ooooooooooooooooooor^^^ooooo^^opopop^ 

oooooooooooo^oooooooOo^<5^^vovovoor^i- , ri»-*r^^ 

000000000000S0000000O000000000O0O000 
000000000000S0000000O000000000O0O000 
oooooooooooo2 o00o0oo 2 00000000000 


1 SeqID 1 


OHrs^^^^^ooa\OH(^lfn^ln\o^ooc^OH(Nfnt^n\o^ooc^OfH(Sn^^n 

ro ro ro co ro co ro <o ro ro r f )( r )f f ir r i( f )r f if r ir r ir r irnf r ir f )f r )r r )r r )r f i ro n m ro ro ro ro ro 
ro ro ro ro ro ro ro m ro ro rnrof^rornrororo r<i rn rn t<i r<i t^i ro ro ro ro ro ro ro ro ro ro 


Clone Name | 


^nnnn^Tfioin»n(»ooo\o\0\oOHr\i{Siohoooooo^oO(NHfon^\0^oo 
0O0OOO00O0O0OO0^-*r-tO0OO0O000^*-<^0OOOO0O 

rOro^ro^^romrOro^^rocO^ro^^^^^^^S^S; ^ ^ ^tniS&^K^uo 
<Orocororo<Orororororo^ro[^^roro^cOrororO 

oooooooooooSooo 00 ooooooo 0 o 00 ooooooOo 
oooooooooooSo00 00 ooooooo 0 o 00 ooooOoOo 

88§88§8888go8888 8§88§88o8§8§888888§8 


1 SeqID 


^vo^^^^^^^Iv^^ooooooooooooooooooooc^o^o\o , l0^c^o\o^o^o\0000 

OOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO^r^^— < 


| Clone Name ' 


000O0OO00OOO000^»— 't-i^OOOOOOOOOOOOOOO 

8§8§888S§8§888§§§888SS§8888888838838 
SaSSsssSwSwSSSSSSaSSwS^ 


SeqID 


cooooommcowoomoooooooooooooooooooooomoooooooow 



229 



WO 02/077183 



PCT7US02/09107 



Clone Name 


S1M10000042A06 
SIMIQ000042B06 
S1M10000042C06 
S1M10000042E06 
S1M10000042F06 
S1M10000042A07 
S1M10000042B07 
S1M10000042D07 
S1M10000042H07 
S1M10000042B08 
S1M10000042E08 
S1M10000042F08 
S1M10000042G08 
S1M10000042A09 
S1M10000042B09 
S1M10000042F09 
S1M10000042G09 
S1M10000042B10 
S1M10000042C10 
S1M10000042D10 
S1M10000042F10 
S1M10000042G10 
S1M10000042A11 
S1M10000042B11 
S1M10000042C11 
S1M10000042D11 
S1M10000042F11 
S1M10000042H11 
S1M10000042A12 
S1M10000042B12 
S1M10000042G12 
S1M10000043B01 
S1M10000043D01 
S1M10000043F01 
S1M10000043G01 
S1M10000043H01 


SeqID) 




Clone Name J 


SS^Z^SS^^^^oooooooooooooooooooooooooo 

000000Q00000O0000 ^ o oooo00 0 ooooo 0 oooo 
ooooooogooooooooooooooogoooooooooooo 
oooooooSoooooooooOooooo00 0 ooooo 0 ooOo 

ooooooo5oooooooooOooooooo 0 ooooo 0 ooo 0 


SeqID I 


00000\0\0\0\0\0\0\0>0\O^OOOOOOOOOOhhhhhhhhhh^MM(N 


! Clone Name I 


^Hrn^oco^^^^oo<»H(S^^fSrnfon^■^■^^^ooooooo^o^OOOrHr^r-(^^rt 
OOOOOOOOOOO'-'*-'— '—<OOC>OOOC>000000«-'*-«'-<'— tr-Hr-»r-iO 

oooooooooooooooooooooooooomooWoooo2mm w moomooooSmS m 
ooooooooooooooooooSoooooooogogooo 0 oo 

ooooooOooooooooOoo2 000ooooo 2°2 0 2 0 2 


! SeqID 1 


^^ooo^o^(Nfo^■^ny^^ooo^o^^l^nT^■^r^\o^ooc^OH^Jfn^■v^vo^ooc^o^ 

co co v~\ co co co co co co cn co co co to to fo n co co co co ro co cn co co co co co co co co co co co co 
co co co ro co co co co co f^f^rnr^ror^MrOro co fO co ro fO ro n co co CO co co co co 


| Clone Name 


ooo^c^0^cs^NfNnnnTfT^t^^^clO(^o^^csrscN^^vo^^^oooooooo 

000»— « r-« *— iOOOOOOOOOOOOOO»— ' »— i f-H i— iOOOOOOOOOOOO 
^cococoro^f^cof^cocorococococof^^^S^ 

ooooogOoOoooooooSooS°ooo 0 oOooooooooO 
ooooogooOooooooogoogoooo 00 oooooooooO 

CD < j c~5 CJD (~^> CD CD / — s /— ) <«j /—j /~s <— <» CD f-^s > CD c — i <— ■» f — i CD f~> CD <*-^ e— i f — •> *— ^ <— s t — i r- s ^D 
a >C < y <«_^ 2c v.^ 'w' v^* ' v—* 2^ u U 3T -* w t j c 3 t j i > c 3 c^j trj ' 

ooooo2 0 ° 0o ^ ooooo oo o ooOooooooooooocd 


1 SeqID 


;2;wnvor^ooc^0^cSro^u^vor^c©<^o 


| Clone Name 


OOOOOOOOOOOOOOOOOOOOOOO^r^^^^r^^r^^r^i-Hl-3r-J 

oooooooooooooo2ooooo2oooooooooo o o o o o 

oooooooooooooo^ooooo^ooooooooooooooo 
oooooooooooooogooooogooooooooooOoooo 

SS8§§8Sg§§SS§§S8§8S§§§§8SS8§8S§Sl88§ 

aaaaaasaaaawsaaaaaaaasssasasaasasasa 


SeqID 


(N^^^^^^ooa^OH^rn^^\o^ooo^o^cNn^^n\o^ooo\OrHtMrn^w^^D^ 
^\ovovo^^o^\o^^^^^^^^^-^ooooooooMMM«MM^o^SSSSSS 

OOOOOOOOOOOOOOMOOOOOOMOOCOOOOOMOOOOMOOOOOOOOOOOOOOOOMM 



230 



WO 02/077183 PCTAJS02/09107 



Clone Name 


iilililliilllllillillliilllllillllll 

wwSSwwWwcoKSSwWww COCO cococococOcogococowcocococococococo 


SeqID | 


^^»Mf^owNf^^w^^o^ooc^o^(Nfn^w^vo^ooo^OHNfn^^^^200^0H 
mSmwmmmwmwwSmoooooooomoomoooooom 


Clone Name| 


SSSoSSooooSSoooooSSSSoogooooooOooogg 

§§88§§ggg8gg88888888888°8888888ggg88 

55cowcococococococOcowcococoWOQcoco 


SeqID 1 


^^nvn^M(^OHNrn^^^^MO\OHNm^ln^o^oo<^o^(Sfn^^^o^ooo^ 

c*Z ,2* Jo v 2: v -J J-* i-l ?>J I-f v-: irl vr» A vr> vrs vri AvnVfty , i\DVD^O\0\Dy3\bVDVD , iOVOVO 

\t5 Vn vfj vq vn vn \D VD vrj \o \D >0 >LJ \(J Vf >fc/ vt> M-' > *-' > w > *«' > H' 'fcp' ^-f 


Clone Name 1 


^OOr-looOOOOO^'— ''—•OOOOOOOOOOOOOOOOi— "OOOOO 

!!§88!§88g8§g888888888888 8888888888 8 
SwwwwwSwwiiiwS^ h<^ W 


i SeqID I 


(^^n^^\o^ooo\OrHM^n^^^o^ooo\Or-l(Nn^^ovo^ooa^o^(sn^v>\o^ 

OOOOOOOOOOroOOOOON^ON^ONC^ONONCA^OOOOOOOO 

rn j»— . . r-. -a . -s\ -s*. «#\ «a r*r\ -rf* «rf- ^4- *r-t- v(> »f «■+» *r+" H> ^4" TT ^T" TT V 1 

cn cn cn cn rn co cn cn cn rn cn cn cn rn ~ • cn m r» i ^sj >r v >r >r nj >i m ^h ->j 
cncncncncncncncncncncncncncnrncoro 


Clone Name 1 


c^OrHTHO^rn^^^w^^\o^^oocoo^a^c^rs^^rsfn^^ln^^ooooooooorH^l 

Oi-"^OOOOOOOOOOOOOOOOT-'OOOOOOOOOpOOOj-;^«r? 
cncn^cncn£3^^5Ci^$3^£3<^^^^^^S^ 

8888g8888g888gg888888888888888888888 

^^H HH H^ H H H H H H^ H rHHHH^HrHHHHH^ HHHH H H 


1 SeqID 1 


CNicscv}c^<N(NCNcqcvicNrvi(NCv}<N<NcsrscscN 


Clone Name 1 


^-it-iOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO 

V—/ HH >— * hH ""^ W HH 1— 1 HH ^ \«/ W W HH hH I— J HH >-~^ i-H HH 1—j r ~ 1 HH ^< HH r— ^ V^/ HH 

SooooooSooooogoooooooooSoogooooooSSo 

888888888888888888888888 8.8 §888888888 


SeqID 





231 



WO 02/077183 



PCT/US02/09107 



Clone Name 


S1M10000043H10 

S1M10000043A11 

S1M10000043C11 

S1M10000043E11 

S1M10000043H11 

S1M10000043A12 

S1M10000043B12 

S1M10000043C12 

S1M10000043D12 

S1M10000043E12 

S1M10000044B01 

S1M10000044D01 

S1M10000044E01 

S1M10000044A02 

S1M10000044B02 

S1M10000044E02 

S1M10000044F02 

S1M10000044G02 

S1M10000044C04 

S1M10000044D04 

S1M10000044B05 

S1M10000044G05 

S1M10000044H05 

S1M10000044A06 

S1M10000044B06 

S1M10000044C06 

S1M10000044D06 

S1M10000044E06 

S1M10000044F06 

S1M10000044H06 

S1M10000044C07 

S1M10000044E07 

S1M10000044H07 

S1M10000044A08 

S1M10000044B08 

S1M10000044C08 


SeqID 




Clone Name | 


oooooog§o8ooooooooo«oooo§oooogooooOo 

§§l§88§g88888888888o88§8o8888S88S8§8 


SeqID 1 




Clone Name | 


K1M10000022C10 
KIM10000030C03 
K1M10000030C04 
K1M10000030C07 
K1M10000030E07 
K1M10000032E11 
K1M10000033E01 

v 1 x x 1 r\r\r\r\c\i id/io 

K1M10000037D10 

K1M10000038D04 

K1M10000039A12 

K1M10000043E02 

K1M10000043D05 

K1M10000043H10 

K1M10000044G05 

K1M10000045D10 

P1M10000008G04 

P1M10000008C06 

P1M10000010C03 

P1M10000015C09 

P1M10000018B01 

P1M10000018C01 

P1M10000018E01 

P1M10000019F01 

P1M10000019E02 

P1M10000021G03 

P1M10000021G05 

P1M10000022D09 1 

P1M10000024H03 

P1M10000024D06 

P1M10000024E06 

P1M10000025G07 

P1M10000025H07 

P1M10000026H02 

P1M10000026F04 

P1M10000026H05 


SeqID 1 




Clone Name 1 


nr^Tf\o^ooooo^H(SrHNfS^^vo^oooo«)ooo^o\■c^HNvo^^ooHH(Sfnfo 

ooooooOooooo 000 oo 00 oOooOoooogogoooo 0 
ooooooOoooo0 00 ooo 00 oo 0 oooooogogoooo 0 

§88 88§o8 88 8§88 88§88§§8§o8§§8§8§2§22§ 


1 SeqID 1 


r^r^t^r^oocwooooooooooooooooo\a\cACA<^cn<7\<^o>c7»oooooooooO'--»'-' 

HrlrHHHHHHHHHH-HHHHHr^HHHrHHHtSrSfSfSfSMMCStNNMr^ 


Clone Name 1 


ooSooooooSoooooooo<5a>e>oooooa>ooooooooo 
ooSooooooSoooooooOoooooooo 0 ooooooooo 
ooSooooooSoooooooooooooooo 0 ooooooooo 
ooSooooooSoooooooooooooooo 0 ooooooooo 
oo°oooooo°oooooooooooooooOoooooooooo 

aasaaaaaasaaaaaaaaaaaaaaaasaaaasaaaa 


SeqID 1 


T^v^\o^co(^o^f^^n^^^^wo^o^(^)rn^^o\o^oo(^o^f^^n^v^vD^ooo^ 

Q\ <2s G\ 0\ <>, <^ <3s 0\ <J\ Os 0\ Os <^ <^ Q\ <2s <^ 



232 



WO 02/077183 



PCT/US02/09107 



Clone Name 


S1M10000044D08 

S1M10000044F08 

S1M10000044G08 

S1M10000044H08 

S1M10000044A09 

S1M10000044D09 

S1M10000044E09 

S1M10000044H09 

S1M10000044D10 

SIM10000044E10 

S1M10000044F10 

S1M10000044G10 

S1M10000044H10 

S1M10000044A11 

S1M10000044B11 

S1M10000044C11 

S1M10000044D11 

S1M10000044E11 

S1M10000044G11 

S1M10000044H11 

S1M10000044A12 

S1M10000044B12 

S1M10000044C12 

S1M10000044D12 

S1M10000045B01 

S1M10000045D01 

S1M10000045A02 

S1M10000045B02 

S1M10000045C02 

S1M10000045B03 

S1M10000045C03 

S1M10000045D03 

S1M10000045G03 

S1M10000045C04 

S1M10000045E04 

S1M10000045F04 


SeqID | 




Clone Name 1 


oooo°ooooooOooooo00 0 oooooOoOoooOoooo 
oooo°oOoooooSoooooo 0 oooooo 0 ooooOoooo 


SeqID 1 


\o^oQO^OHN^^«n\o^ooc^OH(srorf>nvo^oo^OHN^3)n^^M20^ 


Clone Name 1 


CP O O O O CP O O O O O O O O O O O O O 7 ~~ K *—* o o o o o o cp o o o o cp cp 

ooSoooooooooSSoOoooooooOooooooooooSS 
ooooooooooooSSoOoooooooOoooooooooogo 
ooooooooooooSSoOoooooooOoooooooooogo 

O CP O O O CD O CP O O O O = 2= O S=r O CP O CP O O CP p= CP CJ C3 CP CP t— » O w CP CP i=r P= 

oooooooooooo22o^oooooooOoooooooooo22 


I SeqID 1 


^o^nlA^u^^o\o\o\ovo\£)\c)vo\o\o^o^^^^^^^^^^oooooooooooooooooooo 
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^"^""^•■^* 


| Clone Name 1 


C5 CP CP CP CP CP CP CP CP CP CP CP CP CP CP CP CP CP CP c? »"*■* i—* CP CP CP CP CP CP CP 

ooooo°ooooOooooooOooOooooooooooooooo 

OOOO0O0OO0O0OOO00O00O0OOOO0OO0OO0OOO 

§ooo§oooo8o8ooo§§o§SoSoooogoogoo 


1 SeqID I 


^lnt>n v o^ooo^o•-<Nn^■>o^^ooo^OHfSfot , n\o^ooa\OH(^^fn^■^n\o^ 


1 Clone Name 1 


^r>innn^Trioio\o^hcoooooooo\0\OOOOHH^^r-(rs(NNNr-iHHNrN| 

000000000000000000000000S000OS000000 
oooooooo 0 ooooooooooooooo2ooo00 0 ooooo 
000000000000000000000000S0000S000000 

88838§§38§8§88§8888§8§88§388§§§83888 
wwwwpqwwwwwwwwwwwwriwwww 


SeqID 


^^^^^^^^^^ MMWWW ^^00^^0>^C>»^CA^CA^^0>OOOOOO 
ONO\ONONO\(^ONO\0\^^^^0\^0\0\0\0\0\<>0\(^^0\ChO\C3\a\0\000000 

r-l ^- »— 1 ^-4 1 »— 1 



233 



WO 02/077183 PCT/US02/09107 



Clone Name 


googgggogggggoogogooogooogoggoogoogg 

g§S||g8§Sg§SSSogggggg§SS88S§ogg§8gS8 


SeqID] 




Clone Name 1 


L-M h*H : 1 : <, UJ I ] \J <i, LJ HH s_/ HH >■/ ' \^ h-M >^ X ' HH rr* *^ ■ . 

SS|oogoo|oOoooo|oOooog||ooo||||oggg| 

8§gggg8gggggggg§gggggg§8ggggg§§gg8|g 

O0oooo 0 ooo0oooo0oooooooo 0 oooooo 0 oooo 


SeqED 1 


1^ L L * ' 1 : . .>* -j* ^* rC* -d »sL tCL rC r>L t^. i-*. t^. r»«. r*». r^. r*«. h»> I — . h^. r*»« 


Clone Name 1 


00«— ■ © i— iOOOOOOOO»-»000»— i *- lOO*-''— < © ' © O »— iO©©*- 1 ©©*-*© 

oS§ooo©ooS55SSSSoSoSo5Sod©©o©©g©©oo© 

Oooooo©o°o©©©©ooOo©©oo©So©o©o©g©ooo© 
©oo©oo©o°oo©©ooo©oOo©oo2o©©©ooooooo© 

8ggggggg88888g888ggg8g888888g8888888 


1 SeqED 1 


o^(N^n^^nvo^ooc^o^Mfn^tnvo^ooo\o^^|fo^u^\o^ooo\0^(^^n^^n 
0\0\ONO\0\OM3N^O\0\OOOOOOOOOOH^^HHrtHrHHH(S(SN(SMfS 

^ ^ ^q" ^ Tr ^4 <T) lO V] */) v) v) IO l O W» «*» wi Ti v> «/> VI w> Wi v) v) V) IT) v J 1/) v) Tl v) 

f^f^rnnnf^ror^rOf^rnr^r') ro rorororornrnroro r*"i ro ro r') ro ro co ro 


Clone Name 1 


ooogoo 0 oooOoooooooo 0 oooooo 00 oogoooOo 
ooo5oooooo©ooooOoo©ooooooo 00 ooS©°o©© 
oooSo°oooo©©ooo©ooOooooooo 00 oos o °o©o 

O00 0 0©0©©©©©©0©°00©©0©©©© 0 ©©©©°0©0©© 

wSSwSwSwwSSSSw 


1 SeqID I 


oo^OHCNn^^vD^ooo^o^H(sn^l^)\^)^ooo^o^^sn^u^\o^ooo^o^(^ln 


Clone Name 


o)Nrnrnnrnro^^»ntn>nvovo^hhooooooooooooco(>o\ChChOOOOOOOrH 
000000000000000000000000000O1— •—•^H'— '1— 

|_j lJU % l-M HH TtT W V- ' ^ W »J-t M-f " (_M H-C V_/ ^ M-4 V— ' MH ^ >-M ^ v — ' I — 1 |Jh ^ MH V> |_| 

ooooooS^©oo©©gSooo©oooSSSooo©o©ooo5o 
ooo©oo 0 o 0000 oo 0 ooooo©o 0 ooooo©ooo©goo 
oooooo 0 goooooo 0 ooooooo 000 ooo©o©ooSo© 
o©oooo 0 o 0000 oo 0 ooooooo 000 oooooooo©o© 

000©©°© 0 000©0©0©0000©0< = )o C )Oo©©©©OO c> ©© 


SeqID I 


oooooooooooooooooooooooooooooooooooo 



234 



WO 02/077183 PCT/US02/09107 



Clone Name 


§§§8g8S8§8£83pQ83£88§8888£«8|§8§83§8 

iill§l§§§§§§8S§g§§§§§§§§§|§il§i§§§|| 

oo8o°ooOooogooooooooOoooo2 o0 2 0 S2 00 22 
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 


SeqID | 




Clone Name 1 


ooooooooooooooooooopoqoppoooooooop^^ 

gggggggggggggogggggggggggggggggggggg 

§§§iiillii§§||i§§sl§s§S|§i§-lll||§§§| 


SeqID 1 


^\o^^•^^^^^^^^^ooooooooooooooooooooo^o^o\a\o^o^o^o\a^g\oooo 

L * ' L L L L_ L_ L L_ r |^ i-^ p~^. r~-^. r-^. r-^, r-^ | — no qq qq DO 

F«- r-- r-*. r — ■ r- r~- i~- r^- i P** r** r*« r"* i r-- r- v i^* i . i . 17 17 i_7 »_7 :5l ~V 
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 


Clone Name! 


oooooooQooooSoooOoooOoooooOoooS°oooo 
oooooooo 0 ooo2oooOoooOoooooOooo£ooooo 
oooooooooooogooooooooooooooooogooooo 

838§8§888§88§88883§8§88§388§88o§§8§8 


SeqID 1 


\o^ooo^o^Nn^ln\o^ooc^OH(Nn'^r^n*o^C)OO^OH(^^n^^n\o^ooo^OH 

VI v» l/l V) VI V| Vl VI IT) V) Vl V) V) VI Tl i#> Tl v| Vj V) V| Vj V) VI VI vi V | VI V| V) Vi tf] V) VI VI V| 


Clone Name 1 


OOOOOOOOOOOt-i^t-<00000000'-»0000000000000 

oooooooooogooooooooooooooo00 00 oooooo 
ooooooooooSoooooOooooooOooggoooooooo 
oooooooooogoooooooooooooooggoooooooo 

oooooooooo°oooooOooooooooo22 oc> 00 2 


1 SeqID 1 


^u^vo^oo(^OHcs^^u^\o^oo(^OH(Nn^lnvo^ooc^o^^lfn'^>ovo^*ooa\ 

OOOOOOCX>OOOOON^^O%ON^<^^^^OOOOOOOOOOr^^^-»^T-ii--t^-i^H^»--i 
rs cs cs cs cn <N cN cs cS cN cs cs cN ro co on co ro rn ro ro ro rn ro ro co r*~i ro co ro ro ro 


1 Clone Name 1 


^H^HN^l^l^^l^^M^t^lwooooHHH^(^l^l(S^ , ^n>o^lolovovovo\o\o^^ 

,-.,-.^,^^,^,^,-.0000000000000000000000000000 

rnrn^rorn^^rn^^^^r-r^r^r^oooooopoooOooooo^ooo 

ooooooSo2 o0 °ooooooooooooooooooooSooo 
000000S0000000000000000000O00000S000 
ooooooSoOoooooooooooooo 0 ooOoooooSooo 
ooooooSoSo2°ooooooooooo 0 ooOoooooSooo 
oooooo°o < ^oooooooooooooOoooOooooo°ooo 


SeqID 


oooooooooooooooooooooooooooooooooooo 



235 



WO 02/077183 



PCT/US02/09107 



Clone Name 


ooogoooggooSooooooooSoooOooooogoooog 

§§§88 8§8 8§§88§§§888§8§8 88§8§§§o8§8 8§ 


SeqID 1 


oooooooooooooooooooooooooooooooopooo 


Clone Name! 


oooOooOoooooooooooooo o0 ooOoooogoooo§ 

8§8§§881888§888§888888§88§§8§§88S§§§ 

cocococococowcocococococococo 


SeqID 1 


ooooooHMHrHrHMH,HMrH^rJNfN(NfS(N(NfS(NronnnnfOrnnfOfn 
OO 00 00 00 00 00 0O OO 00 oo oooooooooooooooooooooooooooooooooooooooo oo oo oo OO 00 oo 


Clone Name 1 


OOO^^OOOOOOOOOOOOOOOO-OOOOOOOOOOr-H^OOO 
K&gSS^£^?R£^?^Soooo?Ooo£ 

oo2oooooooSoooooox<=>SoooooOooooooooog 
ooSooooooo^oooooogoSooooogooooooooog 

oogooooooo^oooooogo^oooooooooooooooo 

oo 0 ooooooo 0 oooooo 0 o 0 ooooo 0 ooooooooo2 


1 SeqID 1 


^^nrt^n\o^(»c^o^^^^^^n^o^MO\o^Nc^^>nyo^oo(^OH(N(<^'^r^n\o^ 
\o^o^\ovo\ovo\o^^^^^^^^^^ooooooooooooooooooooo^o^o^^o\o^o^O^ 
10 10 10 w"> >/*> i/") i/^ i/*> v> V) v> 10 un vo 10 *o vo 10 v> v> *o i/~> 10 i/^i *o u"> */"> v> v> *o 
f*l n r<i i^i rO ro ro r**> rn ro r^irnr^ror^nn^ c<~i ro cn r*~> ro rn r*"i ro r^i r**) cn rO cn ro 


| Clone Name I 


OOO— '^^OOOOOOOOOOOOt-h^h^h^h^OOOOOOOOOO^- 1 ^^ 

000O00S0O00O0000000000000S0S000000O0 
oooOooSoOoooooOooooooooooSoSooooooo 0 
2SoOoogooooOooOooooOooooogoSooooooo 0 
oooOoo 0 o^ooOoooooooOooooo°o 0 ooooooOo 


1 SeqID 




1 Clone Name 


OOOOOOOOOOOOi— 'OOOOOOOOOOOOOOOOOOOOOOO 

ooo 0 oogoogooooooooooooooooSoooOooooo 
ooo 0 o 0 o 00 o 0 ooooooooOooooooSoooOooooo 

888888888§888888 88 8888 88 88§888888888 


SeqID 





236 



WO 02/077183 



PCT/US02/09107 



Clone Name 


S1M10000047F03 

S1M10000047.G03 

S1M10000047H03 

S1M10000047A04 

SIM10000047B04 

S1M10000047C04 

S1M10000047D04 

S1M10000047E04 

S1M10000047F04 

S1M10000047G04 

S1M10000047H04 

S1M10000047A05 

S1M10000047B05 

S1M10000047C05 

S1M10000047D05 

S1M10000047E05 

S1M10000047F05 

S1M10000047G05 

S1M10000047H05 

S1M10000047A06 

S1M10000047B06 

S1M10000047C06 

S1M10000047E06 

S1M10000047F06 

S1M10000047G06 

S1M10000047A07 

S1M10000047C07 

S1M10000047D07 

S1M10000047F07 

S1M10000047G07 

S1M10000047H07 

S1M10000047A08 

S1M10000047B08 

S1M10000047C08 

S1M10000047E08 

SIM10000047F08 


SeqID 1 




Clone Name I 


ii§8§l§lsg§§§§§l§§§ss8|§i§s§|§i§§||§ 

ooooooOoooooooo2 000000 2 0 2 00 2 000 2S222 


SeqID 1 


MMWMMMOOOOOOOOOOOOMMMOOMOOMOOOJOOWMOOOO 


Clone Namel 


rj f > O CD CD Q i— < C3 CP CP CP Q»P CJ tP U tP U ' J V.P *■»■»' v—* ; j ' s — ■-' * — ' ' 

Soooooooooooogoooooooooooooooooooooo 
oOoooooooooo 0 Ooooooooooooogoo 0 oooooo 
oooooooooooo 0 oooooooooOooogooooooooo 

O C O O O O O CJ CP W w it O it c — ' O <-> O O CP O it C? CP ^p it it it ^—^ ~f 1 — ' S=r it ii. 

oooooooooooo 0 °ooooooooOooooooooooooo 


1 SeqID 1 


ooc^o^H^ m ^t^^^ooo^OrH(sn^^^^ooo\OHfSrn^»n^^ooo\0^(Nn 


I Clone Namel 


(SMN»H(Nmf0^tTtiO\0\OhhO\0\OOOH(NMrS^Tf^iOOOOOO\ONO\a\0\0 
«-h »-h C_p CP C2 CP v«P CP CP CP tP CP tP C^ s«P t - * * CP *«P CP CP CP **P <«P CP v-J v»p ^-P i»P »^ 

ooSooooooooooooooooQooooooSooOoooooo 
ooSoooooooooooooooooooooooSooOooSSoo 
oo^ooooo o Ooooooo^oo^^^oooo^oo o o o ^ ^ o o 
oo°ooooooOooooooOooooooooo°ooOoo o0 oo 


1 SeqID 1 


vo^oo^o^H(sn^■ln\o^ooo^O'-'(Nfo^^o\o^ooc^o^rsn^lo\o^ooo^o^ 
ln«n^^n^ovovo^ovovo\o^o^ovo^^^^^^^^^■^ooooooooooooooooooooo^a\ 
rocommr^rnroro rn rnf^fonf^fir^r^nrorofirornf^f^ rn rn ro n ro rn rn 


Clone Name 


o^o^o^o^c^oooooO'- , ^-tHHrHf^^HHr)nfnn^^■ooHrHr«^(^lfSnr f )n , ^Tj■ 

ooOooooooo 0 oooo 0 ooooooooOoogooooo 0 oo 

ooOooooooo 0 oooo 0 ooooooOooooOoooooooS 
Oo0ooooo00 0 oooo 0 oooooo0oooo0ooooo 0 o0 

88o§§8§8§§83§o§8§§8§8§§8§§8o§8§8888§ 


SeqID 


^^vo^ooo^OH^^rn^^^^^ooo^o^^lcn^^\orNooo^O'^^^fnr|■lnvo^ooo^ 



237 



WO 02/077183 



PO7US02/09107 



Clone Name 


iiillillllilllf iliilllllllllllllilll 

^^^^^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ 


SeqID 




Clone Name | 


S1M10000023G06 

S1M10000023H06 

S1M10000023B07 

S1M10000023D07 

S1M10000023E07 

S1M10000023F07 

S1M10000023G07 

S1M10000023H07 

S1M10000023B08 

S1M10000023D08 

S1M10000023F08 

S1M10000023G08 

S1M10000023A09 

S1M10000023B09 

S1M10000023D09 

S1M10000023G09 

S1M10000023H09 

S1M10000023B10 

S1M10000023C10 

S1M10000023D10 

S1M10000023E10 

S1M10000023F10 

S1M10000023H10 

S1M10000023A11 

S1M10000023B11 

S1M10000023C11 

S1M10000023E11 

S1M10000023F11 

S1M10000023G11 

S1M10000023A12 

S1M10000023B12 

S1M10000023C12 

S1M10000023D12 

S1M10000023F12 

S1M10000024D01 

S1M10000024A02 


SeqID 1 


^^ooc^o^csf^^t^^o^ooo^OrHcsnrrw^Ao^oo(^o^Nn^^^^oo^O|H 

^Kh^M0000000000000000000\^0\0\^(J\Ch0N0\0\OOOOOOOOOOHH 
MMOOOOWOOOOOOOOOOOOOOOOMOOOOMOOCOOOOOMOOOOCAC^Ch^qi^^^^^^W 


Clone Name 1 


ooooo^^^^oooooooo^ooooooo^^^oqpp^ooo 

ooSS^^O\5ooOooHSHH(N(Sri(v|(N<Nnnnn^t^S^^S 
5o\oSP^^^^^moomotoooooooooooowooooooWoooooo^ 

oOoo°ooooooooooooooOoooooooooooooooo 
0O00S0000000000000000000000000000000 
oOoo°oooooooooooooooooooooooooOOOooo 

§§§§8§g8§8§888888gg888888g88gg8 8 8g88 


SeqID 1 




Clone Name) 


00^^(Sf^^r^nrnf0^^mvo^^^ooo\0^^-lf^^r^^^^f^^^lnvovoooo^o^O^-^^^ 
^^t-h^r-.^hooooooooooooo— <»-«^hoooooooooo»-'po 

oooOoooooOoooooOoooOooooooooS^oo^qOo 
oooo 00 ooo0 0 ooooOooogooooooooSooo^oOo 
oooOoooooOoooooOooogoooooooogooogQOo 
oooOooooo c - > oooooOooo 0 oooooooo 0 ooo°o2 0 


1 SeqID 1 




Clone Namel 


E3M10000052G04 
E3M10000052C05 
E3M10000052D05 
E3M10000052F05 
E3M10000052G05 
E3M10000052G06 
E3M10000052H06 

E3M10000052F08 
E3M10000052E09 
E3M10000052G09 
E3M10000052F10 
E3M10000052D11 
E3M10000052D12 
1008-H20 
1011-P20 
1053-37 
1010-C11 
1017-H1 
1067-16 
1083-27 
1065-12 
221-41 
B 17-6.0 10 
910-B20 
B18-2.N21 
971-B20 
D1-1.A15 
4-28.1 
D1-2.B13 
D1-2.P21 
Z56-D2 
PJMF55 
R1-15.A13 
R1-19.H1 
R1-55.M2 


SeqID 1 


^^u^^ou^^n^«^>ntrl^vo\ovo^\ovo^\ovo^^^^^^^^^^oooooooooooo 



238 



WO 02/077183 



PCT/US02/09107 



Clone Name 


oooooooooooooooooooooooooqpppooopopp 

§ooooooooogoOoooooOoooooooooo 00 ooooo 
§ooo§ooooo§oOoooooOoooooooooo 00 ooooo 
§ooSSooooo§oOoooooOoooooOoooo 00 ooooo 
Soo8oooooo§§o§SSoooSoooooooooooooooo 


SeqID | 


iAS^o^KK8vo^\0'o^o^^o^o^^^^^^^^^^^oooooooooooooooooqoo 

,-H t— * »— 1 c t— « «— 1 l— < l-H f-H i-H f— < t-^ t-H ^ r-H i— ( r-H T— « r— 1 r— * t— 1 i— 4 «— < T— » i— < »— * T-H »— 4 »— 1 f-H »-H 1 — » 1—4 *— * 


Clone Name| 


ooooooooooooooooooooooooooopopo^^p"^^ 

oo°ooopoooooop 0 oooooopoooppo 00 opogpp 
Oo0ooo0ooooooo 0 o0ooooooooooo 00 ooopoo 

lislis§l§§IS8§8S§8S§§8SS8SS§8gSS§Sg8 


! SeqID 1 


(NfnTl■u^\c^ooc^o«Hrsfo^^vo^oo^o^cNn^^\o^ooc^OrHCNn5^n^o^ 

On On On On On On On On on On On On On on On On On On on On On On On on on On On On On On On On On On On On 
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^"^ 


Clone Name 1 


csr^intfTt(^ON'H^^oo^H\oooHa\^o^cN^<nvoa\0(snfonr s -ooo\o\n^ 

OOOOOOO'— 'OOO^-'OOO'— iOO'— 'OOOOOOr-ir-iOOOOOOOOO 
> A TtI m ft, : ^ fr. > f ^ > rM r\ ftl fT. Tt. > frl rh /ift fil w [l (L A lYi ffft ^ m tL ^ f ^ nft TtI M [l 
"'S i_J l-M UM tr* ^ tr* ^ w ^ M-l M-l M-J M-) *T* (_4 W (-M hh M-( H-l ^ 

ooooOoOooooooooooOooooopoooooo 0 ooooS 
oooopopooooooppooooooopp 0 oooop 0 ooooS 

8888§88 888 88§§§88§8§§§§§§§§§§§88§§8S 

p H CMp^PHpHp H PHp 4 pL l p H p H p H p^a.p*cu ( a<fe0 M 


1 SeqID | 


i^r^t^r^r^r^r^t^t^r^oooocxjwcooooooooooooNONONONONONONONONONOOOOOO 

vovovovdvovovo^vovovovovovovovovovovovovovovovovonovo vo *o I s - I s - I — 

n n n co rn fi n r'l rn n fonrof^rnnroMrnrofOrOfin rn rn rn n ro cn 


j Clone Name 


OOOOOi-"— •'-•OOOOOOOOOOOOOOOO^— «»-*^H^^^r-HOOOOO 

oooopopooooooop 0 ooSooooooooopopooooo 
ooopoopooooooop 0 ooSoooooooooo 0 oooooo 
oooppopooooooopooogooooooooopopooooo 
oooooo°oooooooOooo^oooooooooOooooo©o 


1 SeqID 


ooc^o^^)^o^^vo^ooo^OHf^^^OT^^o\o^oo(^o^f^^n^•lnvo^'00o^OrHf^^^n 


1 Clone Name 


Z45-F11 
Z8-B9 
E1M10000007B04 
227-10 
709-F23 
801-C15 
801-H19 

807-D20 
B13-17.G8 
B5-6.C8 
. B8-2.D9 
B15-8.P13 
T13-5.A2 
T12-3.I11 
T20-15.D4 
T24-15.G6 
T24-17.C6 
244.B12 
1042-Jl 
195.F5 
25.D5 
25.D6 
177.F3 
525.H11 
632.N2 
63337 
671.00 
676.B12 
643 .B 19 
720.016 
666.H12 
98.D4 
844.B21 
P31-25-F3 
P335-8.H8 


SeqID 


^^ooo^OH^}fn^ln\o^ooo^OH(^ln^v^\o^ooo^o^^^rOT^ln\o^MO^o^ 

OOCO(»(»ONONONONONONONONONONOOOOOOOOOOf--i^r^^^^r^i^r-Hr-HCNCN 



239 



WO 02/077183 



PCT/US02/09107 



Clone Name 


OOOOOOO*— ■ »—»»— 1.— I r- 1 r-< »-< • »-H r- t r-< »~* 

Igggggggggggggggggggg 
Il8l88|ggg8gggggg§ggg 


SeqID] 




Clone Name| 


S1M10000024D11 
S1M10000024G12 
S1M10000025B01 
S1M10000025C01 
S1M10000025D01 
S1M10000025E01 
S1M10000025B02 
1 SI Ml 0000025 A03 
S1M10000025B03 
S1M10000025C03 
S1M10000025D03 
S1M10000025F03 
S1M10000025D04 
S1M10000025E04 
S1M10000025G04 
S1M10000025B05 
S1M10000025C05 
S1M10000025F05 
S1M10000025H05 
S1M10000025B06 
S1M10000025D06 


SeqID 1 


oo^OHfsm^io^hooo\o>- • n m in oo 


Clone Name 1 


P1M10000094H04 
P1M10000094A10 
P1M10000095C01 
P1M10000095E04 
P1M10000095G04 
P1M10000095C09 
P1M10000102E05 

P1M10000103B05 
P1M10000103D06 
P1M10000103E08 
P1M10000104A02 
P1M10000104H02 
P1M10000104A03 
P1M10000104E03 
P1M10000104F07 
P1M10000104D11 
P1M10000105D01 
P1M10000105E02 
P1M10000105C03 
P1M10000105G03 


| SeqID 1 


OOOOHHHT-.HH^HHH(SMN(SN(Sn 


1 Clone Name 1 


OOOOOOOOOOOO^i-Hr-»OOpO*-»0 

ooo2oooooooo£oOoooooo 
ooo°oooooooo£oOoooooo 
ooogoooooooogoooooooo 


I SeqID, 


vovovovovo»or-»r^r^r^r^r-*p^r^r-»r-oooooooooo 

(N(NfSfNrN(NMN(N(NM(N(N(N(NNrSfScNCS(N 


! Clone Name 1 


csoo^H^-<r^vorrcscNomcocs*-ioo<NOO»-^o 


SeqID | 





240 



WO 02/077183 PCT/US02/09107 

EXAMPLE 3 

Comparison Of Isolated N unleic Acids to Known Sequences 
The nucleotide sequences of the subcloned fragments from Escherichia coli t Staphylococcus 
aureus, Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas aeruginosa and Salmonella 

5 typhimurium obtained from the expression vectors discussed above were compared to known 
sequences from Escherichia coli t Staphylococcus aureus, Enterococcus faecalis, Klebsiella 
pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium and other microorganisms as 
follows. First, to confirm that each clone originated from one location on the chromosome and was 
not chimeric, the nucleotide sequences of the selected clones were compared against the 

10 Escherichia colU Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, 
Pseudomonas aeruginosa or Salmonella typhimurium genomic sequences to align the clone to the 
correct position on the chromosome. The NCBI BLASTN v 2.0.9 program was used for this 
comparison, and the incomplete Staphylococcus aureus genomic sequences licensed from TIGR, as 
well as the NCBI nonredundant GenBank database were used as the source of genomic data. 

15 Salmonella typhimurium sequences were compared to sequences available from the Genome 
Sequencing Center (http://genome.wustl.edu/gsc/salmonella.shtml), and the Sanger Centre 
(http-7/www.sai^er.ac.uk/proiects/S typhi). Pseudomonas aeruginosa sequences were compared to 
a proprietary database and the NCBI GenBank database. The E. faecalis sequences were compared to 
a proprietary database. 

20 The BLASTN analysis was performed using the default parameters except that the filtering 

was turned off. No further analysis was performed on inserts which resulted from the ligation of 
multiple fragments. 

In general, antisense molecules and their complementary genes are identified as follows. 
First, all possible full length open reading frames (ORFs) are extracted from available genomic 

25 databases. Such databases include the GenBank nonredundant (nr) database, the unfinished 
genome database available from TIGR and the PathoSeq database developed by Incyte Genomics. 
The latter database comprises over 40 annotated bacterial genomes including complete ORF 
analysis. If databases are incomplete with regard to the bacterial genome of interest, it is not 
necessary to extract all ORFs in the genome but only to extract the ORFs within the portions of the 

30 available genomic sequences which are complementary to the clones of interest. Computer 
algorithms for identifying ORFs, such as GeneMark, are available and well known to those in the 
art. Comparison of the clone DNA to the complementary ORF(s) allows determination of whether 
the clone is a sense or antisense clone. Furthermore, each ORF extracted from the database can be 
compared to sequences in well annotated databases including the GenBank (nr) protein database, 

35 SWISSPROT and the like. A description of the gene or of a closely related gene in a closely related 
microorganism is often available in these databases. Similar methods are used to identify antisense 
clones corresponding to genes encoding non-translated RNAs. 
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In order to generate the gene identification data compiled in Table IB, each of the cloned 
nucleic acid sequences discussed above corresponding to SEQ ID NO.s 1-6213 was used to identify 
the corresponding Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella 
pneumoniae, Pseudomonas aeruginosa or Salmonella typhimurium ORFs in the PathoSeq v.4.1 
5 (March 2000 release) database of microbial genomic sequences. For this purpose, the NCBI 
BLASTN 2.0.9 computer algorithm was used. The default parameters were used except that 
filtering was tinned off. The default parameters for the BLASTN and BLASTX analyses were: 
Expectation value (e)=10 
Alignment view options; pairwise 

Filter query sequence (DUST with BLASTN, SEG with others)=T 
Cost to open a gap (zero invokes behavior)=0 
Cost to extend a gap (zero invokes behavior)=0 

X dropoff value for gapped alignment (in bits) (zero invokes behavior)=0 
Show GI's in deflines=F 

Penalty for a nucleotide mismatch (BLASTN only)=!3 
Reward for a nucleotide match (BLASTN only)=l 
Number of one-line descriptions (V)=500 
Number of alignments to show (B)=250 
Threshold for extending hits=default 
Perform gapped alignment (not available with BLASTX)=T 
Query Genetic code to use=l 
DB Genetic code (for TBLAST[nx] only=l 
Number of processors to use=l 
SeqAlign file 

Believe the query defline-F 
Matrix=BLOSUM62 
Word Size= default 

Effective length of the database (use zero for the real size)=0 
Number of best hits from a region to keep=100 
Length of region used to judge hits=20 

Effective length of the search space (use zero for the real size)=0 
Query strands to search against database (for BLAST[nx] and TBLASTX), 3 is both, 1 is 
top, 2 is bottom=3 

Produce HTML output=F 

35 

Alternatively, ORFs were identified and refined by conducting a survey of the public and 
private data sources. Full-length gene protein and nucleotide sequences for these organisms were 
assembled from various sources. For Pseudomonas aeruginosa, gene sequences were adopted from 
the Pseudomonas genome sequencing project (downloaded from http://www.pseudomonas.com). 

40 For Klebsiella pneumoniae, Staphylococcus aureus, Streptococcus pneumoniae and Salmonella 
typhi, genomic sequences from PathoSeq v 4.1 (Mar 2000 release) was reanalyzed for ORFs using 
the gene finding software GeneMark v 2.4a, which was purchased from GenePro Inc. 451 Bishop 
St., N.W., Suite B, Atlanta, GA, 30318, USA. 

Antisense clones were identified as those clones for which transcription from the inducible 

45 promoter would result in the expression of an RNA antisense to a complementary ORF, intergenic 
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or intragenic sequence. Those clones containing single inserts and that caused growth sensitivity 
upon induction are listed in Table IA. 

The gene descriptions in the PathoSeq database derive from annotations available in the 
public sequence databases described above. Where a clone was found to share significant sequence 
5 identity to two or more adjacent ORFs, it was listed once for each ORF and the PathoSeq 
information for each ORF was compiled in Table IB. 

Table IA lists the SEQ ID NOs. and clone names of the inserts which inhibited 
proliferation. This information was used to identify the ORFs (SEQ ID NOs.: 6214-42397) whose 
gene products (SEQ ID NOs. 42398-78581) were inhibited by the nucleic acids comprising the 
10 nucleotide sequences of SEQ ID NOs. 1-6213. Table IB lists the clone name and the PathoSeq 
Locus containing the clone. 
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TABLE IB 



Clone Name 


Gene 
LocusID 


Clone Name 


Gene 
LocusID 


Clone Name 


Gene 
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OAT TO A 1 101 
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EFA205258 


E1M10000233D08 


ECO103185 


S1M10000005D06 
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E3M1O0OO001BO2 


EFA205225 


E1M10000233F08 


ECOl 03265 


S1M10000005A07 


SAU800967 


E3M10000001B02 


EFA201977 


E1M10000233F08 


ECO103266 


S1M10000005B07 


SAU802496 


E3M1OOOO001BO2 


EFA203137 


E1M10000233A09 


ECO104092 


S1M10000005D07 


SAU801264 


E3M10000001C02 


EFA200840 


E1M10000233A09 


ECO104093 


S1M10000005A08 


SAU802496 


E3M10000001D02 


EFA202003 


E1M10000233E09 


ECO103238 


S1M10000005B08 


SAU800548 


E3M10000001E02 


EFA200840 


E1M10000233E09 


ECO103239 


S1M10000005D08 


SAU800607 


E3M1OOOO001F02 


EFA200807 


E1M10000233F09 


ECO103886 


S1M10000005E08 


SAU802496 


E3M10000001G02 


EFA205257 


E1M10000233D10 


ECO103242 


S1M10000005B09 


SAU800122 


E3M100OO001G02 


EFA205258 


E1M10000233D10 


ECO 103243 


S1M10000005C09 


SAU801481 


E3M10000001H02 


EFA200811 


E1M10000233H10 


ECO100094 


S1M10000005D09 


SAU800542 


E3M10000001E03 


EFA201987 


E1M10000234E01 


ECO103884 


S1M10000005A10 


SAU801723 


E3M1OOO0001E03 


EFA205258 


E1M10000234B02 


ECO 103886 


S1M10000005A10 


SAU801722 


E3M1OOOO0O1GO3 


EFA201987 


E1M10000234G02 


ECO103233 


S1M10000005A11 


SAU801644 


E3M100OO0O1G03 


EFA205258 


E1M10000234G02 


ECO 103234 


S1M10000005C11 


("I A T 111 

SAU801113 


E3M10000001H03 


EFA201987 


E1M10000234C05 


ECO103181 


S1M10000005D11 


SAU800547 


E3M10000001H03 


EFA205258 


E1M10000234C07 


ECO103844 


S1M10000005E11 


SAU800155 


E3M10000001D04 


EFA201980 


E1M10000234C08 


ECO103878 


S1M10000005B12 


SAU802160 


E3M10000001D04 


EFA201981 


E1M10000234C08 


ECO204942 


S1M1O0OO005B12 


SAU603460 


E3M1 000000 1D04 


EFA205229 


E1M10000234F08 


ECO103461 


S1M10000005D12 


SAU801644 


E3M10000001E04 


EFA201028 


E1M10000234H08 


ECO103226 


S1M10000006F01 


SAU801264 


E3M10000001F04 


EFA200811 


E1M10000234F09 


ECO103055 


S1M10000006B02 


SAU800381 


E3M10000001G04 


EFA201993 


E1M10000234D10 


ECO100876 


S1M10000006E02 


SAU802496 


E3M1 0000001 H04 


EFA201980 


E1M10000234G10 


ECO100886 


S1M10000006F02 


SAU802160 


E3M10000001H04 


EFA201981 


E1M10000234B12 


ECO104010 


S1M10000006G02 


SAU802125 
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EFA205229 


E1M10000235D01 


ECO 102233 


S1M10000006A03 


SAU802496 


E3M10000001B05 


EFA201993 


E1M10000235A03 


ECO100798 


S1M10000006B03 


SAU802655 


E3M10OO00O1DO5 


EFA201974 


E1M10000235H03 


ECO1038S6 


S1M10000006D03 


SAU801740 


E3M1OOO0001DO5 


EFA201975 


E1M10000235E04 


ECO103236 


S1M10000006E03 


SAU801256 


E3M10000001G05 


EFA202001 


E1M10000235B06 


ECO103886 


S1M10000006F03 


SAU801434 


E3M10000001G05 


EFA202003 


E1M10000235F06 


ECO1034S1 


S1M10000006G03 


SAU801275 


E3M1 000000 1A06 


EFA201028 


t E1M10000235B08 


ECO103885 


S1M10000006A04 


SAU80H39 


E3M10000001F06 


EFA201028 


E1M10000235E08 


ECO103161 


S1M1O00O006BO4 


SAU802496 


E3M10000001B08 


EFA201028 


E1M10000235B09 


ECO101848 


S1M10000006C04 


SAU802158 


E3M 1000000 1E08 


EFA200807 


E1M10000235H09 


ECO 103481 


S1M10000006E04 


SAU80I089 


E3M10000001C09 
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E1M10000235H09 


ECO103482 


S1M10000006F04 
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E3M10000001D09 
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E1M10000235B10 
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S1M10000006G04 
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E3M1 000000 1D09 
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E3M10000001E09 
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E1M10000235F12 
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EFA202022 
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S1M10000006A07 


SAU800967 


E3M10000004D02 


EFA202028 


E1M10000236A04 


ECO103186 


S1M10000006B07| SAU801760 
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iiL/Uil/i JZ*r 


Sl x vT10000019D04 


SAU802154 


E3M10000035H03 


EFA202168 


"CI \>T1 AAAAT71 171 7 

ElIYL 1000027 lr 12 


PpAl A7^A1 


^1M10000019G04 


SAU802238 


E3M10000035A04 


EFA202012 


ElM10000272h02 


T7/^p\ 1 A/t7^7 

bOU 1U4ZJ / 


^1 M1 00000 19 AO 5 


SAU802160 


E3M10000035C04 


EFA200840 


E1M10000272G03 


"C/^Ti t A77<7 

bcu iujzoz 


pi mi nnnflo i pros 


SAU800545 


E3M10000035D04 


EFA200418 


ElM10000272u03 


ur*r\i AT 879 


<i i tji i nonooi qdos 

0 1 1V1 i \J\J\J\J\J x /wvJ 


SAU801275 


E3M10000035E04 


EFA200841 


r-> 1 x / 1 AAAmn^m 

E 1 M 1 0000272G03 


T2fr\l A A 0 A7 


ciui nnOOO 1 9F05 


SAU801731 


E3M10000035G04 


EFA201987 


T» 1 X X\ AAAAnO A A/ 

E lM 1 0000272 AU4 


POm A177 1 
DLU 1 UJZZ 1 


^1 Ml 00000 19H05 


SAU802229 


E3M10000035A05 


■ %«r^ a a a a A * A 

EFA200418 


nn/i AAA AT *7TCA C 

E1M1000U2 /zrUj 


ppAi A^Q1 A 


^1 Ml 00000 19H05 


SAU802228 


E3M10000035C05 


EFA200807 


ElMiOUUUZ/ZAUO 


CfAi A^77^ 


S1M10000019A06 


SAU801605 


E3M10000035D05 


* t~* a aai A a /^i 

EFA201999 


blMlUvJUUZ /ZJduO 




^^10000019006 


SAU802241 


E3M10000035E05 


EFA200326 


1 A. yf 1 AAAAOTIUTAT 

b 1 M 1 UUUUz / ZrLU / 




^1 Ml 000001 9C06 


SAU802240 


E3M10OOO035G05 


■pp a «Anoni 

EFA200807 


t? 1 X. K 1 AAAAT71DAQ 

b 1 M 1 UUUUZ / ZrSUo 




^1M10000019D06 


SAU802442 


E3M10OO0O35A06 


irr^ a A AAA 1 A 

EFA202012 


T? 1 X A 1 AAAAnOPAO 

E 1 M 1 0UUU2 /2uUo 


rpAi aai /:a 


^1 Ml 000001 9F06 

O 1 IVi l WUv v 171 V/VJ 


SAU801565 


E3M10000035C06 


EFA202001 


E I M 1 0000272D08 


bv-uiuzzi j 


Q1M1 000001 QA07 


SAU800492 


E3M10000035F06 


EFA202168 


nn/t AAAAnOHAO 

ElMlUUUUZ/ZOUo 


DLU l U'f Zh j 


<s 1 M 1 00000 1 9 A07 


SAU800493 


E3M10000035H06 


EFA201987 


t**» i x yf! AAAA1T1PAO 

ElM10000272CjrUo 


npni AA7A7 
bCU IU4Z4Z 


Q 1 M 1 00000 1 QR07 


SAU800537 


E3M10000035B07 


EFA202012 


pi \ /i AAAAAT1 OA O 

ElM1000U2721iUo 


bcuiuuzzj 


ciui AAOOO 1 0P07 


SA1 J801275 


E3M10000035C07 


■pP A lAAOAl 

EFA200807 


Tl 1 TV K 1 AAAAITlPAO 


caai AAAQ1 

bUvJiu^L/y i 


oiyi 00000 1 QD07 


SAU802654 


E3M100O0035E07 


EFA202378 


m 'N/TI AAAA'iTJfAO 

b 1 M 1 UUUUZ ilKJOy 




<s 1 Ml 00000 19E07 

O A IVI 1 \J\J\J\J\J I Ji->\} 1 


SAU800591 


E3M10000035F07 


EFA201993 


r?l x/1 AAAAOTIT^AO 

El Ml uuuuz /zijuy 


CpAl A1 /CO/1 


c i aai AA0001 QH07 


SAU801476 


E3M1OO0O035AO8 


"PP A *\f\t\CS A f\ 

EFA200840 


P1»/i AAAAOT1 AAA 

E 1 M 1 000U272 JJUy 


bcuiuiooj 


Q 1 M 1 00000 1 Q A OR 




E3M1 000003 5B08 


EFA200677 


T? 1 X Jf 1 AAAA^TjAAfl 

El MlUUUU272UUy 


bcuiuiooo 


Q 1 M 1 00000 1 QROR 


C ATTR00122 


E3M10000035C08 


T^T^ A A A 1 A A 

EFA201979 


p>1 x /I AAAAOHApAA 

ElM1000U272uUy 


UPA1 A1 1 OA 

bUU 1 U 1 1 54 


Q1 \/fl AA00A1QR0R 


<Z AT JR00171 


E3M10000035C08 


PP A ^rtl AOA 

EFA2019S0 


t?i »/1 AAAAnOUAA 

b 1 M 1 UUUUz /zriUy 


CAAi A1 77,4 


Q 1 Ml 000001 0P0R 


q ATIR01940 


E3M10000035E08 


V^T~* A A A ^ A A A 

EFA201999 


p i » / i AAAAT7TDA 1 

E1M10000273E01 


CPA 1 AAAQC 

bcu l uuuy $ 


o iMiuuuuuiyruo 


AT TROT SI 7 


E3M10000035F08 


¥~»T^ A AAAAAV* 

EFA202006 


E 1 M 1 0000273D02 


bLUlUj231 


CI A/f 1 AAAAA1 Q^JAR 


<3 ATTR07^S4 


ir* x /i A A A A A p" y"i f \ A 

E3M10000035G08 


EFA201028 


null aaaaiticai 

ElM10000273b03 


bUUlUJOCiO 


C 1 \A"\ AAAAA1 QHAR 
o 1 ivi 1 UUUUU li/rlUo 


QATIR00016 


■» r * a a a a a r* a A A 

E3M10000035A09 


pp a ^ a 1 nm 

EFA201987 


T7 1 X X 1 AAAA111 P\AC 

E 1 M 1 0000273D05 


bCUlUl JZ4 


o 1 \/f1 AA00A1 OA 00 


Q AT TOO! 1Q^ 


E3M10000035C09 


EFA200842 


T? 1 X jf 1 AAAAOT1 T7AC 

blMlUUUU2/ibUj 


npn 1 Ani a 

bcuiuj /iu 


Q 1 \A 1 AAOOO 1 QROQ 


CAT 1001 I"}! 
O/WJOVJl Ua 


r t a a a a a ^ r a A 

E3M10000035E09 


EFA205207 


T? 1 X A 1 AAAAnTJPAC 

E 1 M 1 0000273 CjU5 


p/~i/^\ 1 AIT/IO 

bUUlUiz4Z 


C 1 \A\ AAAAA1 QTIAQ 
olMl UUUUU 1 ynVJy 


CAT TR01 *H7 


E3M10000035F09 


pp a r» n 1 <*\ O /T 

EFA201986 


T"» t X X 1 AAAA1 m ^ A C 

E 1 M 1 000027 3 G05 


bCUlUi243 


Q 1 \ A 1 AAAAA 1 OF* AO 

o i m i uuuuu i yijuy 


oat TR07AA0 


T**i ^ V * * A A A A A m *■» A A 

E3M10000035F09 


pp a ^ A f 

EFA205255 


T~> i X / 1 AAAAT71 

El Ml 0000273 C07 


bLUlU32oi 


C 1 A A 1 AAAAA t QPAQ 

o lMiuuuuuiyruy 


C AT TROO^ 1 7 
oAUoUUjI / 


Y** *S "A. f -t A A A/\A^ p* A A 

E3M10000035G09 


EFA202217 


pi x/i AAAAniPAO 

E1M10000273C08 


ppp 1 AT OTA 

bC01U323U 


C 1 A >f 1 AAAAA 1 QPAQ 

biMiuuuuuiyruy 


q ATT707^71 
oAUZUZOZO 


E3M10000035G09 


T— • A AAAA*f 

EFA202216 


PI »/i AAAAO^ 1 ) OAO 

E1M10000273C08 


TTPPVI ATI! 1 

ECOl 03231 


O 1 K A 1 AAAAA 1 AAAA 

s i m i uuuuu l yuuy 


CAT TCAA^77 


E3M10000035G09 


PP A 1 f\ A C\ ^ /~ 

EFA104836 


pi x/i AAAAOT5 /~i -1 -i 

E1M10000273G1 1 


bCO 1 UU 1 o4 


O 1 A A 1 AAAAA 1 QD 1 A 

o iMiuuuuuiyoiu 


CAT TRA1 *\ 1 R 


E3M 1000003 5 H09 


■pp a r\ r\ c\ r\ r 

EFA200805 


T7 1 X >T 1 AAAAAT5 i t 

E1M10000273GH 


ppA1 AA1 OC 

ECO 100 185 


C 1 A A 1 AAAAA 1 Qf"l 1 A 

oiMiuuuuuiyuiu 


CAT TRA1 777 
bAUcSUl /Z / 


|™> a » #4 A A AAAA ^TS 1 A 

E3M10000035B10 


pr » a r»rto 

EFA201888 


rm/l AAAAnlTTl 1 

E1M10000273H11 


p 1 at i o c 

ECO 103 185 


C 1 A A 1 AAAAA 1 OA 1 A 

olMlUUUUUiyLrlU 


CAT TRA1 "77A 
oAUoUl /ZO 


T** A ^'flAAAAAA^' TfV 1 A 

E3M1O00OO35D1O 


Y - ^* T> a AAAA*1 A 

: EFA202012 


ri i/i AAAAni nil 

E1M10000273B12 


ECO 100869 


ClA/flAAAAAIfl A 1 1 

o IMlUUUUUiyAl 1 


CAT TPA'? 1 </l 

oAUoUZi 04 


p n / 1 rv/\ aa ai c r 1 A 

E3M1O000035E1O 


EFA202298 


Pi Xii AAAAOITP 1 O 

blMlUUUU27iul2 


ppp t r\/\ AQA 

bCUlU4U>U 


C 1 A/fl AAAAA 1 OR1 1 

o iMiuuuuuiyjD 1 1 


CAT TR0O7R6 
oAUoUUZoO 


E3M10000035F10 


T7T~> A 1A1 0^*7 

EFA201867 


r?1 X/f 1 AAAA07/IAA 1 

blM10000274C01 


bCUlU323i 


C 1 \A 1 AAAAA 1 OP 1 1 

o lMiuuuuuiy^/i i 


CAT TROA^7^ 


T7 1 A if 1 AAAAm rpi n 


nr a taao/IA 

br A20Uo4U 


T7 1 \A 1 AAAA07/I /^A 1 

b 1 M 1 \j\)\)\jZ /40U 1 


T7pp i A*307/! 

bv_/UiU3Zj4 


C 1 A/fl AAAAA 1QP1 1 

o im juuuuuiyr 1 1 


C AT TR01 710 
o/\UOUl / iy 


b3MlUUUUU35Al 1 


ct: A 1 A 1 coo 

br AZUlooo 


b 1 M 1 UUUUZ /4JC1U 1 


p/^m A1AA1 


Q1 \A 1 00000 1 QCll 1 
o 1 1YI 1 UUUVJU l y\j 1 1 


CAT TR0777Q 
OUZZZ7 


f^miooooo^sri i 


FFA202012 


E1M10000274G01 


ECO 101259 


S1M10000019A12 


SAUS01680 


E3M10000035C11 


EFA202006 


E1M10000274G01 


ECO101258 


S1M10000019A12 


SAU801679 


E3M10000035D11 


EFA202378 


E1M10000274H05 


ECO 103262 


S1M10000019B12 


SAU802238 


E3M1OOO0035E11 


EFA200807 


E1M10000274H05 


ECO 103878 


S1M10000019C12 


SAU801193 


E3M10000O35F11 


EFA202006 


E1M10000274H05 


ECO204942 


S1M10000019D12 


SAU802226 


E3M10000035G11 


EFA200418 


E1M10000274D06 


ECO102556 


S1M10000020E01 


SAU801186 
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Clone Name 


Gene 
LocusID 


Clone Name 


Gene I 

T AAllcTT) 




Gene 
LocusID 


E3M100Q0035H11 


EFA200239 


nix / 1 AAA AO O A T7A/C 

blMlUUUUZ /4rU0 




^1M10000020F01 

|J 1 1YA. X Ww WmVA VA 


" SAU800391 


E3MI0000035H11 


EFA200240 


T? 1 X jf 1 A A A AO "J A T7 A/£ 

E 1 M 1 0000274F0O 




«51MlOO00O20GOl 


SAU800195 


E3M1O0OOO35B12 


EFA200840 


T"* l \ /T1 AAA AO "1 A T? AO" 

E1M10000274F07 




^1 M1 0000020H01 


SAU802686 


E3M10000035C12 


EFA202006 


" 1 X if 1 AAA f\*y*l A T?AO 

E1M10000274F07 




c i \/f i nnooo20R02 


SAU8013O3 


E3M10000035E12 


EFA202006 


m X 1 1 AA ftAOl A A AO 

El Ml 0000274 AOS 




Q 1 Tv/f 1 0000090H02 


SAU800543 


E3M1O000035F12 


EFA201878 


n 1 X 1 1 A/M^ A*^1 A T*\ A O 

E1M10000274D08 




ciyi Ofl0OO90R0^ 


SAU800089 


E3M1O000036BO1 


EFA201869 


E1M10000274F08 


EAA1 A1 


ciui 000009 on 0^ 


SAU800542 


E3M10000036BO1 


EFA201867 


E1M10000274F09 




ciui OO0O090F03 


SAU802245 


E3M10000036CO1 


EFA201982 


E1M10000274D10 


T7/->/^ 1 AT OOA 


oiui 000009 0D04 


SAU800324 


E3M10000036EO1 


EFA201869 


E1M10000274D10 




cixyfi 0O0O090D04 


SAU800325 


E3M1O000036GO1 


EFA201977 


E1M10000274D11 




KA1 000O09OF04 

O 1 IVl 1 V^UUVJU^»V/liV*t 


SAU802226 


E3M10000036GO1 


EFA203137 


E1M10000274D12 


npn 1 AO COT. 


ciyi A000090F04 


SAU802225 


E3M1O0O0036GO2 


EFA201878 


E1M10000274G12 


T70/^ i Al O O/l 


C 1 A>T 1 0000070H04 


SAIJ802240 


E3M10000036GO2 


EFA201869 


E1M10000275C01 




Q1\^1 OOOOO90AO*; 
0 1 IYI 1 \AJVVV£\JrL\J J 


SAU801263 

|JA\J t» w A **\JtS 


E3M1O0O0036HO2 


EFA200840 


E1M10000275E01 


T7/^iy-\ i AOOO/C 


C 1 1V/1 0000090UO^ 


55AU801279 


E3M1O000036AO3 


EFA202217 


E1M10000275E01 


T7^/^1 AT O OO 


Q 1 \A 1 0000090FOS 


SAU802249 


E3M10000036B03 


EFA200192 


\ /i AAAATICDA1 

E1M10000275B02 




Q1M10000090G0S 


SAU801011 


E3M10000036C03 


EFA202012 


t-> i > t i AAAAOOCD AO 

E1M10000275B02 




1000009 OA 06 


SAU802230 


E3M10000036D03 


EFA202006 


T? 1 X jf 1 AAAAOOC A AO 

E1M1000Q275AU3 


CpAl A1 104 


Q 1 K/T1 0000090R06 

O IIVll\JV/vUV/^vJJvU 


SAU801491 


E3M10000036F03 


EFA201987 


"Cl Xif 1 AAAAOOCQAO 

blMlUOUUZ/DDUo 


Jlv^U 1 VZOj / 


ciui 0000020D06 


SAU800367 


E3M10O00036G03 


EFA200457 


TJ 1 X A 1 AAA AO "7 C Art? 

c 1 M 1 00U02/3UU.5 


CAA1 AO 1 C\ 


Q1 Ml OOOOO90F06 


SAU800232 


E3M10000036H03 


EFA202012 


n 1 * a i A AfT ^T** A /I 

E1M10000275D04 


T7/^»/^ | ao on 


q i \/ri AAAnO90FOfi 


c ATT»01 n OO 


E3M1 0000036 A04 


EFA201982 


El Ml 000027 5 H04 


tJO^ 1 AO QHQ 


c i \/r i AAO0090F0^ 

o I ivi i uuuuuzur uo 


QAT T801899 


E3M10000036D04 


EFA200538 


E1M10000275H04 


T7/~»/^ O A /! A /f 0 


C! 1 \ fi 1 rjAAAOOAWO/ 1 ! 
olMl UU vUUZUtlUO 


<sATTS01 1 79 


E3M10000036E04 


EFA201999 


E1M10000275C05 


bCOlUiooo 


o I IVl 1 UUUUUZUrlUO 


<1ATTP.01 1 R0 

OU 1 1 OU 


E3M10000036F04 


EFA200192 


E1M10000275E05 


T7/-^/^\ I AOOO 1 


c i l^/ri nnnnA9A a 07 

0 1 IVl 1 UUUUUZUAv / 


<IAnR00o"0fi 


E3M10000036G04 


EFA202007 


E1M10000275A06 


TJ/^/'M A1 OOA 

bCUlUl /oU 


0 1 TV>r 1 AAA fWY5 A A CY7 


<?ATTROOS0S 


E3M10000036H04 


EFA205288 


7-111/1 AAAAnCPA/C 

E1M10000275C06 


bUUlUiooO 


Q1X/T1 AOA009AR07 

o 1 JVl I uuuuuzuou / 


q ATTR00001 


E3M10000036A05 


EFA200677 


T"» 1 X /1 AAAA11CPA/J 

E1M10000275G06 


TTpAl AO/^O/t 


C 1 \A 1 AAA AA9A"n07 


OATIR011R5 


E3M10000036E05 


EFA201984 


t—> 11/1 aaaaiicaa^ 

E1M10000275G06 


T7r^r^\ i ao/Co< 


C 1 \>f 1 AAAAA7 ATVA7 
b 1 JVl 1 UUUUUZUL/U / 


<^ATIR01 1 Rfi 
OAUOvI iou 


E3M10000036F05 


EFA200310 


E1M10000275A08 


r?/~'/o t ao o ao 

ECO 103298 


C 1 A 4 1 AAAAA0AUA7 


Q AT IR01 1 70 
OnUOUl 1 / U 


E3M10000036H05 


EFA201976 


E1M10000275D08 


-T7/- i Ann 

ECO 101232 


C 1 A A 1 AAA AAO A/^l A7 


o AT T80101 1 


E3M10000036A06 


EFA200418 


E1M10000275D08 


T— » /— 1X*\ 1 A 1 »^ "1 

ECO101233 


O 1 X if 1 AAAAAOA A At? 

MM1UUUUUZUAU& 


Q AT 1^00^89 


E3M10000036B06 


EFA201999 


E1M1OO0O275FO9 


ECO103221 


O 1 X 11 AAAAA**> AT\ AO 

S1M10000020D08 


CAT TQAOO/1Q 


E3M10000036C06 


EFA202007 


E1M10000275D10 


ECO103414 


ni»/1 AA A A AO ATT A O 

S1M10000020E08 


CAT TQAAC/IC 


E3M10000036D06 


EFA201978 


E1M10000275GH 


ECO103886 


O 1 X 11 AAAAAOA/""* AO 

S1M10000020G08 


C A T TQ A1 QAA 


E3M10000036G06 


EFA201886 


E1M10000275B12 


ECO 103 262 


CI 1 X M 1 AAAAA1ATJAO 

S1M10000020H08 


C A T TC A1 

bAUoUl /4i 


E3M10000036H06 


EFA201999 


E1M10000275B12 


ECO 103878 


C 1 1 X A 1 AA AAAOADAO 

S 1 M 1 UUUUUZUi>l)y 


PAT TfiOOl 


E3M10000036A07 


EFA202015 


E1M10000275B12 


ECO204942 


olMl uuuuuzuuiiy 


c AT TR01 1 8^ 
oAUoUI 10J 


E3M10000036B07 


EFA200839 


t* < 1X11 A A aai^ c /"i 1 ^ 

ElMl 0000275C 1 2 


bC0101oo5 


O 1 \A 1 AAAAAOAT7AO 


QATTR0101 1 


E3M10000036B07 


EFA200840 


P1 \ ii AAAA1TCP 1 O 

E1M10000275G12 


bC01U3Z2U 


C 1 \A 1 AAAAAOAr^AQ 


c at TR01 000 


E3M10000036C07 


EFA200841 


pi »/i AAAAn/TCA 1 

E1M10000276E01 


bCO 103202 


C 1 \A 1 AAAAA0AP1 H 


C AT TOAOOOO 


E3M10000036E07 


EFA202298 


niX/i AAAAOT^tJA 1 

E1M10000276E01 


T7*^«/-\ 1 AT OOO 

ECO 103 878 


CI \A 1 A AAA AO Ar 1 1 A 


cat IQ0991 1 


E3M1 000003 6H07 


EFA200805 


pii ii AAAAIT/f T?A 1 

E1M10000276E01 


ECO204942 


C 1 \A 1 AA AH AO Am A 


c ATTR0999A 
oAUoUZZZ*f 


E3M10000036A08 


EFA200839 


t ' ii/i AAAAA*7^T?A 1 

E1M10000276F01 


ECO 101703 


C 1 \A 1 AAAAAOAi^JI (] 

O 1 M 1 UUUUUZUu 1 U 


cat Tft0999^ 


E3M10000036A08 


EFA200840 


E1M10000276A03 


T7/~i/-\ 1 AA AA C 

bCOlUOuyj 


C 1 \A 1 AAAAAOAtJ 1 C 

o 1M lUUUUUzUrl 1U 


q.a t Tftonc/i^ 


E3M10000036B08 


EFA201888 


p 1 i / i AAAAn/CDAl 

El Ml 000027 6B03 


T7/^r\ i m i on 
bCO103 loo 


C 1 \A 1 AAAAAOA A 1 1 


c AT TR0000*i 


v i\a i aaaaat fVR ar 

133 M 1UUUUU30.DUO 


pi? A901 RRfi 
1_/P/V^.U 1 OOP 


f 1 tvt 1 0000976 A 04 


FCO 100549 


S1M10000020C11 


SAU802507 


E3M10000036C08 


EFA201888 


E1M10000276D04 


ECO101259 


S1M10000020E11 


SAU800467 


E3M10000036E08 


EFA201987 


E1M10000276G04 


ECO100094 


S1M10000020F11 


SAU800160 


E3M10000036F08 


EFA202007 


E1M10000276G04 


ECO100095 


S1M10000020F11 


SAU800161 


E3M10000036H08 


EFA201883 


E1M10000276H04 


ECO104213 


S1M10000020G11 


SAU800391 


E3M10000036A09 


EFA201993 


E1M10000312C01 


ECO103223 


S1M10000020H11 


SAU802047 
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Clone Name 


Oene 

l^OCUSiU 


oT r 

Clone Name 


n T 

Oene 

Locus ID 


Clone Name 


Gene 
LocusID 


T7"j.>.yfi AnriAAi/CPOQ 
b 3 M 1 UUUUU 3 OD\Jy 


PPA9A901 3 


R1M1 000031 2B02 


ECO103229 


S1M10000020A12 


SAU800548 


CTKA 1 AAAAAQ/^fAQ 

Ji J M i UUUUU 3 0 


c>r /vzuuH i o 


E1M10000312B02 


ECO 103230 


S1M10000020B12 


SAU800218 


T21\A 1 AAAAA1£TiAQ 

E. 3M 1 UUUUU J OJJ uy 


pp A 9A9A1 9 


E 1M1 00003 12D02 


ECO 100025 


S1M10000020D12 


SAU801186 


Cl\jf 1 AAAAAI £FAQ 

fcJ M 1 UUUUU3 or \)y 


pp A 9A9974 


F1M1 000031 2D02 


ECO 100026 


S1M10000020E12 


SAU801185 


r?i\ji A A AAA"! ATJAO 

h 3 M 1 UUUUU 3 oriuy 




F 1 Ml 00003 12F03 


ECO 103886 


S1M10000020F12 


SAU801493 


T?*J AvT1 AAAAAQ/C A 1 A 

hi M 1 UUUUU 3 0 A 1 U 


PPA901QR7 


F1M 1000031 2 A04 


ECOl 03242 


S1M10000020G12 


SAU801631 


"CITV/f 1 AAAAA^Afl fi 

E, jM 1 UUUUU JOL/ 1 U 


PP A 90041 R 


E 1M1 00003 12A04 


ECO 103243 


S1M10000021D01 


SAU802468 


bjMlUUUUUJOU 1U 


PPA901 R^O 


Fl Ml 00003 12D04 


ECO 103230 


S1M10000021E01 


SAU801898 
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S1M10000030D09 


SAU801139 


709-F23 


ECO101506 


E1M10000295G01 


ECO103532 


S1M1OO00030F09 


SAU801904 


801-C15 


ECO100488 


E1M10000295G01 


ECO103533 


S1M10000030G09 


SAU800542 


801-C15 


ECO 100490 


E1M10000295B02 


ECO101635 


S1M10000030H09 


; SAU801644 


801-C15 


ECO 100491 


E1M10000295E02 


ECO 103217 


S1M10000030A10 


SAU802309 


801-H19 


ECO 100488 


E1M10000295E02 


ECO103218 


S1M10000030A10 


SAU802308 


801-H19 


ECO 100490 


E1M10000295F02 


ECO100169 


S1M10000030C10 


SAU800562 
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Clone Name 


Gene 
LocusID 


Clone Name 


Gene 
LocusID 


Clone Name 


Gene 
, LocusID 


801-H19 


ECO 100491 


E1M10000295H04 


ECO 1 00663 


o 1 x 4 1 aaaaao (\t\ 1 a 


c attpataai 


804-P6 


ECO 1025 13 


E1M1 0000295 A07 


ECO 1007 12 


o 1 \ai aaaaa^api a 
olMIUUUUUJUrlU 


C ATTRAAA1 Q 


807-D20 


ECO 100366 


E1M10000295B07 


ECO 100 179 


nii/i a aa c\(\ir\n 1 a 
S 1 M 1 UUUUUJtAJ 1 U 


C AT TQAAA1Q 


807-D20 


ECO 100367 


E1M10000295B07 


ECO 1 00 1 80 


C* i\A1 AAAAAO. ALT 1 A 

S 1 M 1 UUUOU 3vH 1 U 


OAT TOAO<Ci. 


B13-17.G8 


ECO 1011 11 


E1M10000295C07 


ECO 103 224 


O 1 Xif 1 AAAAA1A A 1 1 

S1M1UUUUUJUA1 1 


C AT TQAA<1 "7 


B5-6.C8 


ECO 101475 


E1M10000295C07 


ECO 103225 


o i % A 1 AAAAA1 A A 1 1 

SIMIUUUUUJUAI 1 


C AT TO AT £07 
— t 


B5-6.C8 


ECO 10 1476 


E1M10000295C07 


ECO 103226 


o i » a i aa a a ai at\ i i 

S1M10000U3UU11 


C ATTQAA<I1 H 

bAUoUUOl / 


B5-6.C8 


ECO201962 


E1M10000295D08 


ECO 103225 


nil/1 AAAAA1 f\T\ 1 1 

S1M10Q0G03UD1 1 


CAT TOAOyCOl 


B8-2.D9 


ECO103461 


E1M10000295D08 


ECO 103226 


S1M10000030E11 


OAT TO AOO/I 1 

SAU 802241 


B15-8.P13 


ECO101328 


E1M10000295F08 


ECO 103 160 


S1M10000030G11 


OAT TOAAO 1 1 

SAU800811 


B15-8.P13 


ECO101329 


E1M10000295G08 


ECO103217 


S1M10000030C12 


OAT TOA1 tCAH 

SAU801647 


T13-5.A2 


ECO 103059 


E1M10000295G08 


ECO103218 


S1M10000030C12 


•O A TTOA1 £. A £. 

SAU80 1646 


T12-3.il 1 


ECO102857 


E1M10000295B09 


ECO103236 


S1M10000030E12 


'O ATTOAACn 

SAU800537 


T20-15.D4 


ECO101475 


E1M10000295F09 


ECO103881 


S1M10000030G12 


OAT TOA1 CO/C 

SAUoOIj/o 


T20-15.D4 


ECO101476 


E1M10000295F09 


ECO103882 


S1M10000031B01 


O A T TOAOI A(\ 

SAU 802240 


T20-15.D4 


ECO201962 


E1M10000295G09 


ECO 103263 


S1M10000031H01 


'O A T lOAAAOl 

SAU800023 


T24-15.G6 


ECO103059 


E1M10000295D10 


ECO103101 


O 1 X X 1 AAAAA1 I T"> AO 

S l Ml 000003 1B02 


O A T TO AO OAT 

SAU 802247 


T24-17.C6 


ECO102857 


E1M10000295H10 


ECO103263 


S1M10000031E02 


'O ATTOA1A10 

SAU801912 


244.B12 


ECO101763 


E1M10000295B11 


ECO 103229 


. S1M10OOO031F02 


A T TnA«^A1 1 

f SAU80223l 


244.B12 


ECO 101 764 


E1M10000295F11 


ECO 100954 


S1M10000031F02 


SAU802230 


244.B12 


ECO101765 


E1M10000295G12 


ECO103494 


S1M10000031G02 


SAU802235 


1042-J1 


ECO 100702 


E1M10000312D11 


ECO 104091 


S1M10000031G02 


OAT TO AO AO j4 

SAU802234 


1G42-J1 


ECO100703 


E1M10000312D11 


ECO 104092 


S1M100OO031H02 


SAU801355 


195.F5 


ECOl 02842 


E1M10000296B01 


ECO102304 


S1M10000031A03 


SAU802250 


25.D5 


ECO103059 


E1M10000296C02 


ECO 102466 


S1M10000031E03 


r~\ i r rnAI 10>f 

SAU801134 


25.D6 


ECO 103059 


E1M10000296C02 


ECO 102467 


S1M10O0OO31EO3 


SAU801135 


177.F3 


ECO102309 


E1M10000296D02 


ECO 103235 


S1M10000031F03 


SAU802240 


525.H11 


ECO 102857 


E1M10000296D02 


ECO 103236 


S1M10000031G03 


SAU801505 


632.N2 


ECO 104277 


E1M10000296D02 


ECO103237 


SIM 1000003 1A04 


.SAU801434 


633.B7 


ECO 103479 


E1M10000296H02 


ECO 102556 


S1M10000031A04 


SAU302892 


671.120 


ECO 103478 


E1M10000296C03 


ECO 1001 50 


S1M10000031B04 


:SAU800543 


676.B12 


ECO103479 


E1M10000296C03 


ECO100151 


S1M10000031C04 


SAU800738 


643.B19 


ECO100702 


E1M10000296E03 


ECO101086 


S1M10000031C04 


SAU800737 


720.016 


ECO103884 


E1M10000296H03 


ECO 103227 


S1M10000031E04 


SAU800542 


666.H12 


ECO103478 


E1M10000296H03 


ECO103228 


S1M10000031F04 


SAU801517 


666.H12 


ECO 103479 


E1M10000296D04 


ECO 103237 


S1M10000031F04 


•SAU801516 


98.D4 


ECO 103263 


E1M10000296G04 


ECO102144 


S1M10000031G04 


SAU302611 


844.B21 


ECO102144 


E1M10000296F05 


ECO 103886 


S1M10000031G04 


SAU302882 


P31-25-F3 


ECO101686 


E1M10000296G05 


ECO 101467 


S1M10000031F05 


SAU800548 


P335-8.H8 


ECO101041 


E1M10000296H05 


ECO 103094 


S1M10000031D06 


SAU801526 


P347.2 


ECO101086 


E1M10000296A06 


ECO100194 


S1M10000031G06 


. SAU800548 


P31-11-J20 


ECO103228 


E1M10000296A06 


ECO100195 


S1M10000031H06 


SAU600582 


P336-14.F20 


ECO 101 370 


E1M10000296G07 


ECO 102827 


S1M10000031C07 


SAU801760 


P31-27-M1 


ECO 103423 


E1M10000296G07 


ECO 102828 


S1M1 000003 1D07 


SAU801181 


P338-4.M21 


ECO100139 


E1M10000296H07 


ECO 103220 


S1M100O0031E07 


SAU800016 


P334-8.L7 


ECO101256 


E1M10000296H07 


ECO 103221 


S1M10000031A08 


. SAU802365 


P31-2-E16 


ECO 10 1686 


E1M10000296E08 


ECO 100886 


S1M10000031D08 


. SAU801790 


T>T1 C 1 T1 A 




blM1000029oF08 


T , /"~»/*\ 1 AOO i a 

ECO 1032 18 


oi\ii AAAAAT 1 TAO 

S1M1O0O0031E08 


OAT TOAAaT/IO 

; SAU800547 


P334-5.H2 


ECO 100 139 


E1M10000296G08 


ECO103734 


S1M10000031F08 


. SAU801264 


P31-33-N2 


ECO 103241 


E1M10000296H08 


ECO100809 


S1M10000031C09 


SAU801193 


P332-11.C20 


ECO102827 


E1M10000296H08 


ECO100S10 


S1M10000031D09 


SAU800019 


P332-11.C20 


ECO 102828 


E1M10000296A09 


ECO100194 


S1M10000031G09 


. SAU800006 


869.A23 


ECO100390 


E1M10000296B11 


ECO 103229 


S1M10000031H09 


SAU801599 
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Gene 
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Clone Name 


Gene 
LocusID 


Clone Name 


Gene 
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P317-2.A3 


ECO101932 


E1M10000296B11 


ECO 103230 


S1M1000U0J1A10 


" CATTQfWY779 
jAUoUU/ IL 


P326-9.K2 


ECO 103292 


E1M10000296E11 


ECO101684 


SlMlwUUUil^lU 




. P326-9.K2 


ECO 103293 


E1M10000296F12 


ECO101684 


S1M10000031E1O 


cat TS2nnn A i 


P323-8.P1 


ECO101685 


E1M10000296G12 


ECO 100095 


. S1M10000031F10 




P35-8 


ECO 103692 


E1M10000298C01 


ECO101438 


" n a * K 1 AAAAA1 1P1 A 

S1M10000031G10 


CAT TSflflQ AO 


P36-13.E2 


ECO103059 


E1M10000298G01 


ECO 104 148 


' /"i AAAAA1 1 A 1 1 

S1M10000031A11 


Q ATTOA1 HA*\ 
oAUoUI /Hi 


P38-1.G20 


ECO 102227 


" E1M10000298G01 


ECO104149 


S1M10000031B11 


CAT TQA1 Of\C 


P327-50.M10 


ECO 103242 


E1M10000298G02 


ECO102636 


S1M10000031C11 


OAT TO AO 1 <0 


P327-50.M10 


ECO 103243 


E1M10000298C03 


ECO 10323 8 


S1M10000031FH 


OAT TOAA1 1 O 

SAU 8003 12 


P328-8.D21 


ECO 103240 


E1M10000298C03 


ECO 103239 


S1M10000031G11 


OAT TOA1 1"5 A 

SAU801234 


P328-8.D21 


ECO103241 


E1M10000298D03 


ECO103886 


S1M10000031HH 


•OAT TOAAA£1 

SAU 800962 


P328-20.P20 


ECO100541 


E1M10000298H03 


ECO103262 


S1M10000031B12 


O ATTOA1 1 

SAU801621 


P33-1.C22 


ECO 103227 


E1M10000298H03 


ECO 103878 


S1M10000031C12 


♦O A TTOA1 T)l 1 

SAU801741 


X3S107-17 


ECO101475 


E1M10000298H03 


ECO204942 


S1M10000031E12 


'O ATTOftl nc 

SAU 80 1275 


X3S107-17 


ECO101476 


E1M10000298E04 


ECO 100430 


S1M10000031F12 


SAU800244 


X3S107-17 


ECO201962 


E1M10000298E04 


ECO100431 


S1M10000032B01 


SAU802654 


P35-7 


ECO103928 


E1M10000298H04 


ECO 100809 


S1M10000032C01 


SAU800548 


X3S118-9 


ECO103263 


E1M10000298H04 


ECO 100808 


S1M10000032F01 


;SAU800525 


X3S163-1 


ECO 103423 


E1M10000298C05 


ECO103234 


S1M10000032F01 


SAU800524 


X3S204-7 


ECO 103238 


E1M10000298C05 


ECO 103235 


S1M10000032H01 


.SAU802112 


X3S177-4 


ECO101161 


E1M10000298C05 


ECO103236 


S1M10000032H01 


SAU802111 


P342-3 


ECO102104 


E1M10000298D05 


ECO 101539 


S1M10000032E02 


SAU801096 


SC21.1 


ECO103224 


E1M10000298D05 


ECO101540 


S1M10000032G02 


SAU8O083O 


SCI 7.1 


ECO102087 


E1M10000298C06 


ECO101844 


S1M10000032G02 


:SAU800829 


SC13.1 


ECO101347 


E1M10000298D06 


ECO103886 


S1M10000032A03 


SAU802686 


SC13.1 


ECO101348 


E1M10000298G06 


ECO100096 


S1M10000032C03 


:SAU800771 


MC9.6 


ECO102929 


E1M10000298B07 


ECO 100095 


S1M10000032D03 


• SAU801235 


MC9.6 


ECO 102928 


\ EIM10000298C07 


ECO 10263 8 


S1M1O00O032E03 


i SAU802240 


Z60-P16 


ECO 103243 


E1M10000298G07 


ECO103233 


S1M10000032G03 


;SAU801269 


Z86-I21 


ECO 103884 


E1M10000298G07 


ECO103234 


S1M10000032C04 


SAU800771 


E1M10000109A02 


ECO 104 168 


E1M10000298B09 


ECO103244 


S1M10000032E04 


SAU801526 


E1M10000109A11 


ECO 102588 


E1M10000298B09 


ECO103245 


S1M10000032F04 


SAU801139 


E1M10000101F05 


ECO102255 


E1M10000298B09 


ECO103246 


S1M10000032G04 


SAU801703 


E1M10000101D06 


ECO102556 


E1M10000298D09 


ECO101995 


S1M10000032H04 


SAU801263 


E1M10000101D06 


ECO102557 


E1M10000298D11 


ECO103001 


S1M10000032A05 


SAU801256 


E1M10000101A07 


ECO 100500 


E1M10000298F11 


ECO102462 


S1M10000032B05 


SAU800255 


E1M10000101A07 


ECO100501 


E1M10000298F11 


ECO102463 


S1M10000032B05 


SAU800256 


E1M10000101H07 


ECO 104010 


E1M10000311F01 


ECO101259 


S1M10000032C05 


SAU800453 


E1M10000101H09 


ECO103230 


E1M10000311F01 


ECO101258 


S1M10000032F05 


SAU802076 


E1M10000101H09 


ECO103231 


E1M10000311C02 


ECO103226 


S1M10000032H05 


SAU 103752 


E1M10000101C12 


ECO103217 


E1M10000311E02 


ECO103097 


S1M10000032A06 


SAU801738 


E1M10000103B04 


ECO101324 


E1M10000311E02 


ECO103098 


S1M10000032D06 


; SAU801900 


E1M10000103D11 


ECO 1032 17 


E1M1O000311AO3 


ECO103100 


S1M10000032E06 


• SAU801181 


ElMlOOOOUOGOl 


ECO 103242 


E1M100O0311CO3 


ECO 103240 


S1M10000032G06 


SAU800344 


E1M10000110G01 


ECO 103243 


E1M10000311C03 


ECO103241 


S1M10000O32A07 


j SAU801760 


E1M10000110H01 


ECO 103263 


E1M1O000311DO3 


ECO103528 


S1M10000032B07 


, SAU800546 


EIM10000110E09 


ECO 103736 


E1M1O0O0311H03 


ECO102868 


S1M10000032C07 


SAU802555 


E1M10000110A12 


ECO 102035 


E1M10000311D04 


ECO 10263 6 


S1M10000032D07 


SAU8015H 


E1M10000112F05 


ECO104157 


E1M10000311E05 


ECO100255 


S1M10000032F07 


SAU802654 


E1M10000113F02 


ECO101796 


E1M10000311F05 


ECO 102309 


S1M10000032H07 


. SAU802234 


E1M10000113A11 


ECO 104237 


E1M10000311D06 


ECO 1032 17 


S1M10000032H07 


' SAU802233 


E1M10000111C03 


ECO 103238 


E1M10000311C07 


ECO103263 


S1M10000032A08 


SAU800217 


E1M10000111C03 


ECO103239 


E1M10000311E07 


ECO103885 


S1M10000032A08 


SAU800218 
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v^ionc iName 


Gene 
LocusID 


Clone Name 


Gene 
LocusID 


Clone Name 


Gene 
i LocusID 


E1M10000111E04 


ECO103613 


E1M1O000311F07 


ECO103243 


S1M10000032B08 


SAU801523 


E1M10000111F09 


ECO 103624 


E1M10000311D08 


ECO103886 


S1M10000032E08 


SAU802161 


E1M1000011 1F09 


ECO 103625 


E1M10000311C09 


ECO103218 


S1M10000032G08 


SAU800542 


F1M1 00001 15H01 

J^v 1 1V1 1 \J\J \J\J X 1 Jilw 1 


ECO 103777 


E1M10000311F09 


ECO101995 


S1M10000032B09 


SAU800434 


F1M100001 15G02 


ECO103218 


E1M10000311F09 


ECO101996 


S1M10000032C09 


SAU800548 


F1M1 00001 15E03 


ECO103163 


E1M10000311C10 


ECO101259 


S1M10000032D09 


SAU801626 


F1M100001 15G04 


ECO302213 


E1M10000311E11 


ECO102636 


S1M10000032D09 


SAU801625 


E1M1O000115C06 

> ' XXYX X V/w W X X «■/ Vww 


ECO100184 


E1M10000311B12 


ECO100632 


S1M10000032D09 


SAU801624 


F 1 Ml 00001 16B01 

X-v X IVilUuUv 1 X VUv X 


ECO101086 


E1M10000311F12 


ECO103115 


. S1M10000032E09 


SAU801475 


F1M10000106D02 


ECO 103234 


E1M10000292C05 


ECO104183 


S1M10000032H09 


SAU800548 


F 1 M 1 0000 1 06D02 

Xv XXVi. X l/w W X V/UA/l/*i 


ECO 103235 


E1M10000292D08 


ECO 103220 


S1M10000032A10 


SAU801089 


F1M10000106G02 


ECO 103243 


E1M10000292D08 


ECO103221 


S1M10000032B10 


SAU801644 


F1M10000106E04 

Xw 1 IVx X \J\J\J\J X V/VIX^V/"T 


ECO103263 


E1M10000292A09 


ECO101834 


S1M10000032C10 


SAU800155 


F1M10000106F05 


ECO103624 


E1M10000292C12 


ECO101438 


S1M10000032C10 


SAU800156 


F1M10000106F05 


ECO 103625 


E1M10000294F01 


ECO103265 


S1M10000032E10 


SAU801263 


F1M10000106H05 

A»j X IVX X VSVs\Svs X vvl iv>/ 


ECO 103884 


E1M10000294F01 


ECO 103266 


S1M10000032F10 


SAU801434 


E1M10000106H06 

Xv X 1 "X i vvwv 1 uuiiw 


ECO 103394 


E1M10000294C02 


ECO103226 


S1M10000032F10 


SAU102585 


E1M10000106A08 

l—i X 1YX X W W X ww* IVW 


ECO103266 


E1M10000294E02 


ECO103237 


S1M10000032G10 


SAU800548 


E1M10000106E09 

1_/ X 1 YX X WW X W^-'V*' 


ECO101347 


E1M10000294G02 


ECO103238 


S1M10000032B11 


SAU801396 


E1M10000106E09 

Xj X XYX X V W V/ X WJ— • \J ✓ 


ECO101348 


E1M10000294G02 


ECO 103239 


S1M10000032C11 


SAU800117 


EIM10000106G10 


ECO 102 104 


E1M10000294A04 


ECO100095 


S1M10000032C11 


SAU800116 


E1M10000106D11 

XyllTllV/UvvlvV/l/l X 


ECO301060 


E1M10000294C04 


ECO 101 307 


S1M10000032D11 


SAU801626 


E1M10000106D11 

XJllTll WW X W4— ' X X 


ECO302213 


E1M10000294C04 


ECO202164 


S1M10000032D11 


SAU801625 


E1M10000122B03 


ECO 102227 


E1M10000294F04 


ECO 103264 


S1M10000032D11 


SAU801624 


E1M10000123D05 


ECO100139 


E1M10000294F04 


ECO 103265 


S1M10000032E11 


SAU800391 


E1M10000123C09 


ECO 103231 


E1M10000294H04 


ECO 104090 


S1M1OO0OO32F11 


SAU800173 


E1M10000123E09 


ECO 101689 


E1M10000294D05 


ECO100144 


S1M10000032H11 


SAU802496 


E1M10000123H10 


ECO 102255 


E1M10000294D05 


ECO100145 


S1M10000032B12 


SAU801193 


E1M10000123F11 


ECO 103226 


E1M10000294F05 


ECO103243 


S1M10000032C12 


SAU800117 


E1M10000107B02 


ECO 100298 


E1M10000294F05 


ECO 103244 


• S1M10000032C12 


SAU800116 


E1M10000107E02 


ECO 102706 


E1M10000294H05 


ECO102336 


S1M10000032E12 


SAU800526 


E1M10000107E02 


ECO 102705 


E1M10000294C06 


ECO103116 


S1M10000032F12 


SAU801644 


E1M10000107G02 


ECO 103543 


E1M10000294F06 


ECO103878 


S1M10000032G12 


SAU801771 


E1M10000107B03 


ECO 103 163 


E1M10000294F06 


ECO204942 


S1M10000033H01 


SAU800490 


E1M1 00001 07B03 


ECO 103 164 


E1M10000294G06 


ECO 103224 


S1M10000033A02 


SAU801434 


E1M10000107C03 


ECO101468 


E1M10000294G06 


ECO103225 


S1M10000033B02 


SAU802223 


E1M10000107H04 


ECO 102227 


E1M10000294A08 


ECO103236 


S1M10000033D02 


SAU802613 


E1M10000107G08 


ECO100158 


E1M10000294A08 


ECO103237 


S1M10000033F02 


SAU800489 


E1M10000107F09 


ECO 103884 


E1M10000294B09 


ECO 103227 


S1M10000033H02 


SAU800548 


E1M10000107H09 


ECO 102065 


E1M10000294E09 


ECO 100 135 
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S1M10000044H08 


SAU801181 


E1M10000229D12 


ECO103221 


P1M10000108G06 


PAE201697 


S1M10000044A09 


SAU802154 


E1M10000230C02 


ECOl 03263 


P1M10000109A02 


PAE202364 


S1M10000044D09 


SAU800942 


E1M1000O230A03 


ECO100500 


P1M10000109C03 


PAE201267 


S1M10000044E09 


SAU801237 


E1M10000230A03 


ECO 100501 


P1M10000109E03 


PAE205383 


S1M10000044H09 


SAU801355 


E1M10000230F03 


ECO103263 


P1M10000109D04 


PAE201789 


S1M10000044H09 


SAU801354 


E1M10000230B04 


ECO101687 


P1M10O0O1O9AO5 


PAE109011 


S1M10000044D10 


SAU801526 


E1M10000230C04 


ECO 103287 


P1M1OO00109BO8 


PAE202225 


S1M10000044E10 


SAU801342 


E1M10000230E04 


ECO101259 


P1M10000109H09 


PAE202390 


S1M10000044F10 


SAU802309 


E1M10000230F04 


ECO103481 


PIM10000109E10 


PAE200217 


S1M10000044F10 


SAU802308 


E1M10000230B05 


ECO102636 


P1M10000109F10 


PAE204909 


S1M10000044G10 


SAU802309 


E1M1O00O230DO5 


ECO100838 


P1M10000109E11 


PAE203877 


S1M10000044G10 


SAU802308 


E1M10000230H05 


ECO103236 


P1M10000109B12 


PAE205001 


S1M10000044H10 


SAU801515 


E1M10000230A06 


ECO102555 


S4M10000001C01 


STM102449 


S1M10000044A11 


SAU802247 


E1M10000230A06 


ECO102556 


S4M10000002G04 


STM 104276 


S1M10000044B11 


SAU801515 


E1M10000230H07 


ECO101684 


S4M10000002B06 


STM100638 


S1M10000044C11 


SAU802238 


E1M10000230H07 


ECO101685 


S4M10000002G08 


STM 104276 


S1M10000044D11 


SAU801517 


E1M10000230A08 


ECO 100448 


S4M10000002B09 


STM102789 


S1M10000044E11 


SAU801138 


E1M10000230A10 


ECO 103228 


S4M10000019H06 


STM101116 


S1M10000044G11 


SAU801184 


E1M10000230F10 


ECO103237 


S4M10000008H10 


STM102089 


S1M10000044H11 


SAU800367 


E1M10000230H10 


ECO103224 


S4M10000008H10 


STM102090 


S1M10000044H11 


SAU800366 


E1M10000230H10 


ECOl 03225 


S4M10000009E03 


STM102089 


S1M10000044A12 


SAU802240 


E1M10000230B11 


ECO103461 


S4M10000009E03 


STM 102090 


S1M10000044B12 


SAU801526 


E1M10000230G11 


ECO 103886 


S4M10000009C06 


STM103418 


S1M10000044C12 


. SAU802162 


E1M10000230C12 


ECO 103228 


S4M10000009E07 


STM103805 


S1M10000044D12 


SAU800765 


E1M10000231A02 


ECO102637 


S4M10000009G08 


STM103805 


S1M10000044D12 


SAU800766 


E1M1 000023 1C02 


ECO 103886 


S4M10000009B11 


STM104223 


S1M1O00O045BO1 


SAU802240 


E1M10000231D02 


ECO101259 


S4M10000009B11 


STM104237 


S1M10000045D01 


SAU801788 


E1M10000231A03 


ECO101227 


S4M10000009F11 


STM103805 


S1M10000045A02 


SAU801630 



286 



WO 02/077183 PCT/US02/09107 



oione iName 


Gene 


wione iName 


Gene 


Clone Name 


■ Gene 
Locus ID 


E1M1 000023 1F03 


ECO 102X34 


S4M l ooooooor? 1 1 

OHxvx X V/WWV/LfVyyVj i I 


^TTlflOO^RO 

O 1 1VHUUJOU 


C 1 \A 1 AAAAA4<R09 
olMl UUUUUH0 JtJUZ 


C ATT0A90<TA 


E1M1 000023 1H04 


ECO101 IRS 


S4M 1 00000 1 01? 04 


CTM10341R 

O 1 1V1 1 vJH 10 


• Q 1 A A 1 AA AAA4<PH9 
o 1 M 1 UUUUUH3UUZ 


C AT 7AAACCO 


E 1M 1 000023 1H04 

•w x ivx i yuuuAJ 1 xxV/*T 


ECO101 186 

Ijw X v x 1 O VJ 


S4M1 000001 0W04 


otui 03S06 


Q 1 \A\ 0AAA04^R07 
oiM IUUUUUHjoU J 


CAT TQAAC41 


E 1M 10000211 ROS 

X-» XXVX X \J\J\J\jjLiJ xXJUJ 


FP01 03991 


^J4M1 00000 1 OPn^ 


QTM103R09 
O 1 1V1 1UOOUZ 


Q 1 A/f 1 AAAAA/KPAO 
oiJVLl UUUUUH DCU j 


CAT T0A1 OCX 

oAUoUl 


E1M10000231ROS 


FPO103999 


<s4M 1 00000 1 0HH7 


CTM 104993 
O 1 M 1 UHZZJ 


C 1 A*1 AAAAA4^rini 
o IMIUUUUUHjUUj 


CAT TOAAOAO 

oAUiSUUZUiS 


E1M10000231F0S 

Xj 1 ivx i wyuUi J 11 UJ 


EC01 03931 


^4M 1 00000 1 0H07 


CTM1 04937 
O 1 XVI 1 UHZ J / 


Q 1 A>f 1 AAAAA4<riA9 


CAT IOA1 0/£A 


E1M1 000023 1F05 


ECOl 03239 


S4M1 000001 ODOR 

kJ*Tivx L UUuUU 1 UuUO 


<TTM10341R 


A/fl 000004^PO4 
OlMl UUUUUHD V/UH 


CAT TQA91 CO 

oAUoUZl jy 


E1M10000231P06 


FPO103RR1 

XwV»/W IUJ0O X 


<s4M 1 00000 1 0ftOQ 


otu 10341 R 
O 1 M1UJH1 0 


Q 1 A>f 1 AAAAA/l^PAA 
OIMIUUUUUHJV^UH 


CAT TOAO 1 CO 

oAUoUzlDO 


F1 Ml 000023 1P06 

Ij 1 Ivi lUUvUZ. J XvvU 


FPD103RR9 


Q4M 1 000001 OPOQ 


CTNyf 1 0341 R 
oIMIUjHIo 


Q 1 A/f 1 AAAAA/1 CCA/1 

o 1 M 1 UUUUU4 jHU4 


CAT TOAl O AO 

oAUoOlzuo 


F 1 M1 000023 1 A OR 


FPP1 039A9 


c*4A/fi oooooi onno 

o*f 1V1 1 \J\JVUU l UUKJy 


o 1M1UJh1<$ 


CI XX 1 AAAAA/!C17A/1 

o 1 M 1 uUU0u4 or U4 


CAT TOAAOOl 

SAU800771 


F1 Ml 0000231 A 08 

JZ» xiYx 1VJUUUZ J LfWJo 


FPD103R7R 


^4Ayf 1 00000 1 0F\ 1 fi 
MM 1 UUUv/vJ 1UJJ 1 U 


OxMIUjHIo 


C 1 \A1 AAAAA/tCOAC 

o 1 M 1 UUUUU4 jCU j 


CAT TO AT C 1 O 

oAuo0151o 


Fl A/f 10000931 A OR 

.C JLlYi iUUUUZj IrxUO 


FPO90xlQzL9 


Q4\>T1 000001 1 17fK 


olMIUoHlo 


o 1 XXI AAAAA/1 CpnC 

o 1 M 1 U0UUU45 LU j 


CAT TOAl CIO 

SAU801517 


F 1 A/f 1 000093 1P0Q 

E/ iiVl 1UUUUZJ l^Uy 


upn 109997 
.CA_,VJiUZZZ / 


C4\yf i onnAfii 1 rinc 


CT"A>ri A/l 1 11 


O 1 \ >T 1 AAAAA/1 CD AC 

o 1 M 1 UUU0U45 b0 5 


OAT TOAA/lOO 

SAU800478 


F 1 M 1 00009 3 1 frOQ 


FPOl 0396/1 




CTAyf 1 A9AQO 


C? 1 XXI AAAAA/1 CT7AC 


CAT TOA 1 A 1 1 

SAU801011 


F1M1 000093 1P0Q 

C ilYllUUUUZJ 1VJU7 


XSWJiUoZOO 


; q4\>ti nnnnni 1 a no 


CT'Xvf 1 AO AAA 

oiMiuzuyu 


C 1 X *1 AAAAA/1 C A AZT 

S1M10000045A06 


OAT T O AT T A T 

SAU802247 


F1M1 0000931 PI 0 


FPP1 09994 
iUZZ /H 


Q4\/r 1 nnnnni 1 uno 


CTA>ri AACQA 

ol IYLIuUjoU 


O 1 X X1 AAAAA/1 C(~*f\tL 

olM100U0045O06 


O A T TOA1 TTf 

SAU801275 


F1M10000931F10 


FPO1040Q3 


Q4K^i nonnni i tji a 

oHlvll UUUUU 1 lJclU 


oIMIUjoUD 


O 1 XXI AAAAA/1 CTTrv/T 

o 1 M 1 0000045 HOo 


CAT TOAfVC A 1 

SAU800543 


F 1 Ml 000093 ID 1 1 


FPDi039a"3 

CLU I \J JZO J 


Q4JV^1000001 JT7TD 


CTTN/T1 A70A< 

o IMlUJoUj 


CI JIX1 AAAAA/1 C A AO 

olMl 0000045 A07 


CAT TO An CCS" 

SAU800566 


E1M10000231F1 1 

J— < 1 IVX 1 \J\J\J\J£,J X X-i 1 1 


FPO 1 03964 


Q4\>f 1000001 1TM 1 
OHiYll UUUUU 1 ID i 1 


CTAif 1 AO/1 1 O 


C 1 X M 1 AAAAA/1 CT> AO 

S 1 M 1 0000045B07 


O A T TOATTTO 

SAU802228 


E1M10O00231G11 


FP0 100040 


C4M 1 000001 9M0^ 


o IMIUjoUD 


C 1 XXI AAAAA/1 C/^AO 

O1M1U000045C07 


CAT TO At C1 f 

SAU801515 


E1M10000231C12 


FPO10399R 


Q4M1 000001 9T506 


olMIUjyjo 


C 1 \A 1 AAAAA/I C T*\ AO 

S1M1UU00045U07 


C AT TO AAA TO 

SAU800478 


E1M10000231E12 

' k ivx & w\/\yX/»/ lij X ^ 


FrO103933 


Q4M1 000001 9T1 1 9 
OHivi 1 UUUUU 1 ZD 1 Z 


CT*A/T1 AOQAO 

oIMIUjoUZ 


nil /I AAAAA/1 CO AT 

olM10000045Cj07 


OAT TH AA A r\/*L 

SAU8 00499 


E1M10000231E12 

X-» * IT* X UUuU^J 1L( X X> 


FrO 103934 


000001 7TVi9 
OHiYl 1 UUUUU I JxJXJZ 




' C 1 X * 1 AAA A A A CO AO 

S 1 M 1 0000045G07 


OAT 

SAU202691 


E1M10000231G12 


FPO10146R 


Q4M1 00O0O1 ^UH9 
o*tIYl 1 UUUUU 1 JrlUZ 


co'X/ri ai ooo 
olMlUlZ/5 


S 1 M 1 0000045 A08 


SAU802610 


E1M10000214E01 

■1— ' X 1TX 1 UUUUi, X r J— * V X 


Fern oi 7ro 


c^y i nOHOOl 4TJH9 

o^tIyi i uuuuu 1 *friuz 


olMlUiyuo 


O 1 h K 1 AAA AA /I fT\ftO 

S1M1O000O45D08 


/*t a T T A A ^ -1 A ^ 

SAU801193 


E1M10000214F01 

X-/X1YXX V/UV/VJX I HI V/X 


FPD T 04 1 R 1 


oHM 1 UUUUU 14dU j 


oiM 102089 


oi \ >r 1 aaaaaj»^t^ao 

S1M10OO0045E08 


Al A T T A A < S + f\. 

SAU801619 


E1M10000214R02 


FPO1001 60 


L>*f M 1 UUUUU IHD U J 


oTM 102090 


S1M1O0O0O45FO8 


SAU801185 


E1M1 00002 14E02 


FPOl 041 Rl 

Xw WW X V*T lO 1 


oH ivx I UUUUU 1 4JJU / 


cnrXif 1 ao ao a 


S1M10000045GO8 


SAU600582 


E1M1 00002 14H02 


FPOI 03937 


Q4\/i i nnnnn 1 41^^*7 

oH-lVl l UUUUU 1 huu / 


CO^X X 1 AO AAA 

oiMiuzuyo 


P1X/1 AAAAA A e r\ o 

S1M10000045G08 


SAU 103752 


E1M10000214H02 


FPO10393R 


oHxYi 1 uuuuu I Dxiuy 


C'T'X K 1 AT AO A 

ol Ml 02089 


S1M10000045C09 


SAU801144 


E 1 Ml 00002 14E03 




otlYJ. 1 UUUUU I Dtyjy 


ol MiuzUyu 


ni 1 |i AAAAA A CY\f\f\ 

S1M10000045D09 


SAU801516 


El M 100002 14E03 


FPO101 S9S 

IwW X U I Ji-J 


^4M1 000001 ^U1 1 
of Jvl 1 UUUUU 1 jo 1 1 


olMlU2Uo9 


Hi » 11 AAAAA/1 ei~\f\r\ 

S1M10000045D09 


SAU201288 


E1M10000214C04 


FPO109R9R 


Q4"\yf 1 nnn AA 1 ST* 1 1 
041V1 1 UUUUU 1 jo 1 1 


ol Ml 02090 


S1M10000045E09 


SAU802237 


E1M10000214G06 

x-< x ivx ivvwuiltVJUU 


FPH1 0366Q 


C/lA/f 1 AAAAA1 X A AO 

OHM 1 UUUUU 1 OAUZ 


STM 102089 


S1M10000045E09 


SAU802236 


E1M10000214A07 

x^ x ivx X uuUUi, X *TAU / 


FPOl 03SSQ 


C4A/T1 AAAAAK A m 
OHM 1 UUUUU 1 OAUz 


oTM 102090 


S1M10000045B10 


SAU801511 


E1M1 00002 14D0P. 

■*— ' X JVX X V i/UVA X "L/l/O 


FPOl 0O9 57 


QA\A 1 AAAAAOA17AO 

j4M 1 UUUC/UzUr Uo 


STM 1 02835 


SIMIO00Q045D1Q 


SAU801630 


E1M10000214DOR 


FPOl 01 R69 


^4^^! AAA A AO 1 T7A"7 

oHM I UUUUUZ 1 C.U / 


oTM102835 


S1M10000045E10 


SAU800545 


E1M10000214D0R 


FPO1041 R3 


Q4A/T1 AAAAAOODAO 
oHM lUUUUUZZoUz 


STM100693 


S1M10000045G10 


SAU800006 


E 1 M 1 00002 1 4D0R 

x-» x xvx XVWwZ> XtX-^UO 


FPni0^76^ 


QA\A 1 AAAAAOOPkA/i 
oHM 1 UUUUUZzUOh 


STM 103 802 


S1M10000045H10 


SAU800517 


E1M10000214H1 1 

A-» X JVA X UU \J\J£i I Til 1 I 


FPOl 0400O 


QA\A1 AAAAAOOPbA;! 

oHM I UUUUU2ZD04 


STM103815 


S1M10000045A11 


SAU802488 


E1M10000214F19 

Xw 4 IVX X vUV/Ui< X "X 1 £■ 


FPOl 04009 


Q4K/1 1 AAAAAOOD AC 
OHM 1 UUUUUZZoU J 


STM 103235 


S1M10000045B11 


SAU801518 


E1M10000214F12 

Arf .1 ATA X V w WX# X TX 1 X* 


FPO1040Q3 


Q4A/T 1 AAAAAOO/^AO 

oHM lUUUUUZZuU / 


STMI03802 


S1M10000045B11 


SAU801517 


E1M10000215B01 


FPOl 0301 1 


Q4A/1 1 AA A A AO o r* A n 
OHM 1UUUUUZZVJU / 


OTK A 1 A1 oi r 

S1M103815 


S1M10000045D11 


SAU800479 


E1M1 00002 15F01 


ECO103237 


^4A4 1 OOAAAOOn t 0 
OHIvi i \J\J\j\J\JZ£U 1 Z 


OTA A 1 A 1 A 1 n 

o 1 M1U2419 


S1M10000045D11 


SAU800480 


E1M10000215F01 


FPO10393R 


^4\/f 1 OAAAAOTrM o 
OHM 1 UUUUUZZU 1 Z 


OT'X/M AO /I To 

S1M102422 


S1M10000045E11 


SAU800984 


E1M10000215B03 


ECOl 04090 


C^y 1 A0OOO99T3 1 9 
OHxVl 1 UUUUUZZXl I Z 


CO?\yri AOAon 

o I M 102089 


S1M10000045F11 


SAU801193 


E1M10000215F03 


ECOl 02294 


C^yi 00OAA99T71 0 
OHxVl 1 UUUUUZZc 1 z 


onr\ x i ao aa a 
S1M1U2090 


S1M10000045H11 


SAU800517 


EIM100Q0215F03 


ECO102295 


S4M10000024G01 


STM 103802 


S1M10000045A12 


S AU801 593 

kJiT»w ou xjyj 


E1M10000215H03 


ECO103263 


S4M10000024G04 


STM103274 


S1M10000045B12 


SAU801518 


E1M10000215B04 


ECO102636 


S4M10000024C06 


STM100227 


S1M10000045B12 


SAU801517 


E1M10000215C05 


ECO103242 


S4M10000024C06 


STM 100229 


S1M10000045C12 


SAU801801 


E1M10000215D05 


ECO101669 


b'4M10000024F08 


STM100274 


S1M10000045D12 


SAU802231 


EIM10000215F06 


ECO104092 


S4M10000024G09 


STM103802 


SIM10000045D12 


SAU802230 



287 



WO 02/077183 



PCT/US02/09107 



Clone Name 


Gene 
LocusID 


Clone Name 


Gene 

T AAtieTTi 


Clone Name 


Gene 
LocusID 


E1M10000215F06 


ECO104U7J 




STM 104123 


S1M10000045E12 


SAU802249 


E1M10000215A07 


T7/~*/~\ 1 A>1 AQ1 




STM102366 


" S1M10000045F12 


SAU802225 


E1M10000215A07 


ECO 104093 




STM10^R02 


S1M10000045G12 


SAU801275 


E1M10000215BO7 


ECO103331 


c/i\>ri AAAAn / )^un7 


CTM 10^274 

O 1 lVAl VJZ. /*+ 


S1M10000046B01 


SAU802612 


E1M10000215C07 


7-1 /~t/"^ « A*> A /I O 

ECO 103243 


Q/1AA1 AAAAHO^ A 1 1 


O 1 1V1 1 UHZ>^ J 


S1M10000046D01 


SAU800547 


E1M10000215H07 


ECO 103244 


C A \ H 1 AAA AAT ^T? \ 0 


O 1 1V1 1 vJ 0 VJ J 


S1M10000046E01 


SAU801733 


E1M10000215F08 


ECO103220 


O A \ K 1 AA AA AT ^ C1 9 




S1M10000046F01 


SAU800753 


E1M10000215A09 


ECO 103 725 


QA\AA AAAAAlAPOl 


QTM1 00693 


S1M10000046G01 


SAU800111 


E1M10000215G09 


ECO 101468 


C/llVif 1 AAnAnOATtO^ 
o4M I UUUuUZOCU J 


O 1 IVi 1 \J\J£ 1 *t 


S1M10000046H01 ' 


SAU802112 


E1M10000215G09 


ECO 101467 




ctu 100637 


S1M10000046H01 


SAU802111 


E1M10000215C10 


ECO 10075 7 




^TM 102089 


S1M10000046C02 


SAU800530 


E1M10000215D10 


ECO103229 


C/lTVJTl AAAAmfi"R1 0 




S1M10000046D02 


SAU800219 


E1M10000215B12 


ECO101139 


C/l^>^1 AAAAfiOAT? 1 9 


090Q0 


S1M10000046E02 


SAU801572 


E1M10000215F12 


ECO 103 244 


O/iX/f 1 AnAflAO'7'CnO 




S1M10000046F02 


SAU802250 


E1M10000216E01 


ECO103232 


o /IX A i AAAfimTP 1 A 

S4M10000027C1U 




°. 1 N/T 1 0000046F02 


SAU802251 


E1M10000216E01 


ECO103233 


C >1 \ X 1 AAAAAO'7^* , 1 A 




S 1 M 1 0000046G02 


SAU801517 


E1M10000216B02 


ECO103886 


C yl X K 1 AA AA AOQT3 1 T 




^1M10000046A03 


SAU801843 


E1M10000216C02 


ECO103886 


O >*X /T1 AAAAAIftD 1 1 

S4M10000029B12 


QT\A1 01000 


ciui 0000046B03 


SAU800966 


E1M1 00002 16G02 


ECO 104048 


O /! X jT1 AAAAAIOTM 1 




S 1 M 1 0000046D03 


SAU801572 


E1M10000216H02 


Ti A /™\ 1 All ill 

ECO 103243 


C* A~K At AAAAA1 ACAA 

c>4MlUU0UU3UrUo 


QTN/f 1 0701 1 


9 1 M 1 0000046G03 


SAU80212O 


E1M10000216H02 


j"*-t ✓"S -t A A A /I y4 

ECO 103244 


O A \ AA AA AAA'S A17A7 


CfX yf 1 A I 1 1 f. 

o 1 IVLIU L L 10 


<?1M10000046A04 

O 1 1Y1 L VA/UUv/ "vAV~ 


SAU800738 


E1M10000216B03 


ECO 100886 


CMX/I AAAAA1TDA1 

o4M 1 UUUUU3 ZrU 1 




<s1 Ml 0000046 A04 


SAU800737 


E1M10000216D03 


ECO103262 


O /IX X t AAAAATOAA1 


o 1M1U4/.ZJ 


<s 1 M 1 0000046B04 


SAU802235 


E1M1 00002 16D03 


ECO103878 


CrfX/M AAAAA'l^Cm 

S4M 1 OUUOUizr U3 


olMlUiyOD 


<3 1 Xyfl 0000046R04 


SAU802234 

kJAW Ovfatn/7 


E1M10000216D03 


ECO204942 


O A X A 1 AAAAA1 yl A AO 

MM 1 UUUUU34AUZ 


0 IMiUZjOO 


Qixvii noooo46ro4 


SAU801234 


E1M10000216C04 


ECO 100468 


CMX/M AAAAA1/1/^A^ 

o4M 1 UUUUU j4Uv J 




°» 1 M 1 0000046D04 


SAU800001 


E1M10000216F04 


ECO101257 


C /I X A 1 AAA A AO /f PAC 

S4M 1 0000034C05 


CTA/f 1 A7AQA 

o iiviiuzuyu 


qiui no0004^F04. 


SAU802231 


E 1 Ml 00002 16E05 


T-»/— 4 AAZT C A 

ECO100659 


o4M 1 UUUUU34rlU j 


o i rviiuuoj / 


qmi 0000046F04 


SAU802230 


E1M10000216H05 


ECO 103237 


C/lXvT 1 AAAHA1A A A£ 
o4M 1 UUUUU^4 AUO 


QTM10T?A7 
o lLvLlyJJLH / 


^ 1 M 1 0000046G04 


SAU801375 


E1MI0000216H05 


T"»y~« /~\ 1 A'l A 1 O 

ECO 103238 


C/IX/1 AftAnni/l A AO 

o4M 1 UUUUU34 AUy 


CT\/fl AlRfi7 
O 1 lYLI\JJOV/Z 


9 1 1U1 0000046R05 


SAU801253 


E1M10000216E07 


ECO 100456 


C /! X /T 1 AAAfl AQ A A AO 

o4M 1 UUUUU34AUy 


CT]V>T1 0181 S 
olMl U J ol j 


ciyi 00OO046P0S 


SAU801286 


E 1M1 000021 6 A09 


ECO 1 03461 


0/1X/M AAAAAI/ITJAQ 

o4M 1 UUUUU J4ril>y 


QT\/f 1 01 11S 


c 1 x/fi 0000046DO S 


' S AU802247 


1 U jT1 AAA Art 1 1 /\ 

E1M10000216B10 


ECO 101780 


C/tX/T1 AAAAA1/1IJAQ 

o4M 1 UUUUU34rlUy 


QTK/f 1 0 1 1 1 (\ 
oJlVllUi 110 


ciui OOOO046F0S 


SAU802056 


E1M10000216C11 


ECO103885 


C/iX>T1 AAAAA1 ^13 A 1 

o4M 1 0OUUU3 J DV 1 


O 1MIU/4U1 


ciyi 0000046 A Ofi 


SAU800514 


E1M10000216E11 


ECO 103 8 85 


O >*X ift AAAAAO CT*\A1 

S4M10000035D01 




q 1 \a i onnnoAnPOf^ 

o iMIUUUUUHOoUO 


c ATTa0143^ 


E1M10000216H11 


ECO 103001 


O >l X X 1 AAAAAO CC7AA 

S4M1 0000035 F02 


OTX/f 1 A1 1 K 

olMlUI 1 10 


Q 1 \A~ 1 0000046F06 




E1M10000216D12 


ECO 103483 


O » X rl AAAAAO CT?AO 

S4M1 0000035 E03 


biMiuiUby 


o 1 JVl 1 UUUUU40 t>U / 


CATIR01 630 


E1M10000217D02 


A/\ 1 A A ^ O 

ECO103536 


C /IX /T1 AAAAAO CCA1 

S4M10000035E03 


b iMiuzuyu 


q i \>f 1 onnno4^P07 


<3ATTR02247 


E1M1 00002 17E02 


ECO100555 


S4M 1 0000035bUo 


QTKA 1 AACAfi 

MMlUUoOO 


Q1 \Ai 0000046P07 
0 1 JVl 1 UUUUWUL'U / 


cat TR02248 


E1M10000217H02 


ECO 101 684 


p ^ X / i AAAAAO C A AA 

S4M 1 0000035 A09 


olMlUU 12.5 


o 1 JVL 1 UUUUU40ll\J / 


CAT TR0147S 


i x r i aaaa*"* i A a 

E 1M 100002 17C04 


ECO103756 


C/X/1 AAAAA1C A AO 


CTX.A1 AA77A 


R \ M 1 000004 aG07 

1 IVI 1 UUV/UU*tOVJ v / 


C >AXJ801261 


E1M10000217D06 


ECO 103226 


C A X /T 1 AAAAAO <T7fi/C 

o4M 1 UOUOUiOr 00 


CTX/f 1 AA7AO 

o 1 M1UU //j 


<s1K/f1 0000046 A OR 
o I Ivl lUUUUv/*tO/\Uo 


9AT 1801095 


T"> A It Jf i AAAA1 1 TT»A1 

E1M10000217B07 


ECO103161 


C/iX/T1 AAAAAO/CT7A/^ 


O 1 M 1 UU / Z4 


Q11U1 00O0046ROR 


CAT J801618 


E1M10000217BO8 


ECO 103262 


C A X /T 1 AAAAAO/CD AO 


QTlVyf 1 A1 1 1 A 
olMlUl 1 10 


Q1M1 O0OO046P0R 


CATIS00517 


E1M10000217B08 


ECO 103878 


O /t X if 1 AAAAAO CX3 1 1 

S4M 1000003 on 1 1 


C^/f 1 A/1771 


Q 1 \yf 1 00O0046n0Q 
olMlUUUUU40JJUo 


CAT TR01 1 11 


E1M10000217B08 


PAAAAjI A /I A 

ECO204942 


C A X A 1 AAAAAO /XU 1 1 

j o4M 1 UUUUUJori 1 1 


0 1 M11/4ZJ / 


<n1 \A 1 00O0046FOR 
0 1 1YI I UUUUU*+01_»wO 


'sATlRO^dl 


E1M100Q0217G1Q 


rr/~«/^\ 1 aoo 1 A 

bcuiuj/iy 


C/1X/ri AOAOAT7 A (\"X 


CTUI A0S41 
o 1 IVI luUJo 1 


ciui 000004fiFOR 

ij 1 IVI 1 \J\J V/V/V/*tUi uo 


SAU802711 

kj A w OUfa 1 x x 


E1M10000217B11 


ECO 1026 19 


S4M10000037A08 


STM 102401 


S1M100O0046A09 


SAU801109 


E1M10000217C11 


ECO103886 


S4M10000037H09 


STM103180 


S1M10000046B09 


SAU801630 


E1M10000217E11 


ECO101763 


S4M10000037E10 


STM100866 


S1M100O0046D09 


SAU801353 


E1M10000217G11 


ECO 102744 


S4M10000037F10 


STM102366 


S1M10000046F09 


SAU800456 


' E1M10000218D01 


ECO102828 


S4M10000033G05 


STM100137 


S1M10000046G09 


SAU802049 


E1M10000218F01 ECO103236 


S4M10000033F08 


STM102419 


S1M10000046D10 


SAU802223 



288 



WO 02/077183 



PCT/US02/09107 



Clone Name 


Gene 
LocusID 


Clone Name 


Gene 
LocusID 


oione JName 


VJCUC 

T acusH) 

iA/VHJI 1 * 


E1M10000218F01 


ECO 103237 


S4M10000033rU8 


OTTV A 1 AO/I OO 

STM 102422 


Q 1 \A 1 ftfifinfiiln'F 1 0 
0 1 iVl I Uv/UUU'tU Ct L v 


SAU801241 


E1M10000218A02 


ECO 103 624 


S4M1 000003 3U09 


MM1024iy 


d 1 m 1 nnnnfu^ f i o 

O 1 IVi 1 \J\J\J\J VnOr 1 \J 


SAU801760 


E1M10Q00218A02 


ECO 103 625 


S4M 1 000003 iooy 


a 1M1024ZZ 


^ 1 m 1 nnnnfiiifiG 1 0 

O 1 1YI 1 VA/UUVJ*rUVJ 1 \J 


SAU800545 


E1M10000218D02 


ECO 103953 


S4M 1000003 3d ID 




O I IVl 1 UUUUUHUll. 1 \J 


SAU800543 


E1M10000218G03 


ECO 101 686 


nii/i AAAAAA1 f"lA1 

SlMlOOOOOOiDUl 


cat Tftno^/ift 


m M 1 0000046 A 1 1 


SAU800539 


E1M10000218E07 


ECO 104092 


SIMlOOOOOOlrOl 


cat TQAmoo 


ci Ml 0000046 A 1 1 

O 1 iVl l V/U WU*rWA 1 1 


SAU800540 


E1M10000218EO7 


ECO 104093 


p 1 \ / i AAA A AA 1 T\AO 

S1M100000U1DUZ 


Q AT TOA1 1 CO 


91M10000046B11 


SAU802557 


E1M10000218BO8 


ECO 100663 


n i x / i AAAAAA1 P\AO 

S 1 M 1 00000U I uuz 


o at TR01 1 <J1 
oAUoU 1131 


9 1M10000046C11 

O 1 IVl 1 UV/vUvtUv l l 


SAU800219 


E1M10000218HO8 


ECO 100094 


o 1 X if 1 AAAAA A 1 E A1 

S1M1 000000 lbU2 


CAT TQ07*)A7 
oAUoUZZ4 / 


Q 1 Ml 0000046D1 1 


SAU800287 


E1M100OO218HO8 


ECO 100095 


a * ■» K 1 A A A AAA 1 L'AI 

SlM10000001r02 


CAT TGAA</lft 


ciMinnnnfidfini i 

0 llYllvUUUutOA-' 1 1 


SAU200106 


E1M10000218B09 


ECO 103881 


SIM 1000000 1A04 


CAT TQAAOAO 

oAUoUUZUy 




SAIJ802218 


E1M10O00218BO9 


ECO 103882 


4 AT 4 AAA f\F\r\. \ T**f\ A 

S1M10000001E04 


OAT TOA1 1 £L A 
SAUoOzlOO 


CI Ml (\TlC\r\ClAf\A 1 9 
jIMI UUUUU40 A 1 Z 


cat 1901799 


E1M10000218C10 


ECO 102788 


r* 4 » r+ AAAAAA1 T"?A A 

S1M10000001E04 


OAT T/CA1/1AA 

SAUo034oU 


c 1 iv>f 1 nfifinnAfiR 1 9 


SAIJ801275 


E1M10000218A11 


ECO 100095 


A 1 H Jfl AAAAAA1 Cft /( 

S 1 M 1 000000 1 FU4 


C AT TQAO<*17 


c i mi nnnno4fiPi ? 


SAU802601 


E1M10000218B11 


ECO 1033 19 


oi i /1 AAA A A A 1 A AC 

S 1 M 1 000000 1 A05 


CAT TQAT3QC 


ci mi nnnnotdfin 1 9 


SAU800287 


E1M10O00218E11 


ECO 103221 


null AAAAAA1 A AC 

S 1 M 1 000000 i A05 


CAT IftA7*i0Q 


ciyi aa fk004fif) 1 9 

O 1 IVl 1 \AJ\J\J UHU JL/ i Z> 


SAU200106 

UAw wwV AW 


E1M10OO0218E11 


ECO103222 


o i * A 1 AAAAAA1 T\f\£ 

S 1 M 1 OOOOOOlDOo 


C A T TQAfi/id 


Q 1 M 1 0000046F 1 9 


SAU801618 


E1M10000218E11 


ECO103223 


mi f i AAAAAA1 T\A*7 

S1M10000001D07 


C ATTftAl £91 

0AU0UIOZ i 


O llVl l UUUUUt ' * 


SAU802090 


E1M10000218B12 


ECO 103 262 


c\ * X jTI AAAAAA1 A AO 

S1M10000001A08 


CAT TQAAAAC 


C 1 M 1 0OOO0 A7P0 1 


SAT 1801 256 


E1M10000218B12 


ECO 103878 


ni x / i AAAAAAi T7AO 

S1M10000001F08 


CAT TQAAAAC. 

oAUoUUUUj 


Q1 A/f 1 0000047F01 


c AT J800547 


E1M10000218B12 


ECO204942 


A> * % jT 1 AAA AAA 1 A A A 

S1M10000001A09 


OAT TOAACit O 

SAU800548 


c 1 iv a i {\f\f\c\f\Ainr\ i 
olMlUUUUU4/UUl 


CAT TaOOI SI 


E1M10000218C12 


ECO 103265 


S1M10000001E09 


A A TTOA1 /I A 

SAU801619 


01XI1 AAAAA AH/tri^ 

S 1 Ml 000004 / uU 1 


CAT TSIO01 S9 


E1M10000218E12 


ECO100180 


S1M10000001F09 


SAU800548 


Oi x /i AAAAA/ITD AO 

S 1 M 1 00U004 /oUZ 


i CATT8099ilO 


E1M10000218G12 


ECO100886 


S1M10000001AIO 


ri a T tfl A A A Jl ^ 

SAU802247 


O 1 X if 1 AA AAA/1 7PAO 

S 1 M 1 0000047C02 


C AT TCA1 7^ 


E1M10000219C01 


ECO103228 


S1M10000001F10 


SAU802247 


ni \ /l AAAAAinAAl 

S 1 M 1 000004 / JJU2 


CAT TC07107 


E1M10000219C01 


ECO 103229 


>— i -f *X MA A A A A A A A i A 

S1M10000001G10 


OAT lOAAfT) 

SAU800537 


C 1 X A 1 AA AAA/1 TEA.*) 

i>lM100UU04/cUZ 


C AT T ft 00047 
oAU 0UU74Z 


E1M10000219B04 


ECO102814 


j». t jy m AAAAAAi T"^ 1 1 

SIMlOOOOOOlEll 


A TTOA1 >I01 

SAU801481 


oiX/fi AAAAA/lTUm 

b 1 M 1 00 0004 / r U2 


CAT 7ROOS47 


E1M10000219E05 


ECO 103263 


s-m + *T 1 a a A A A AAT^ A 1 

S1M10000002B01 


SAU802247 


OI XX 1 AAAAA/IT^iA*) 

olM100U004 /KjVZ 


CAT TR077/U. 


E1M10000219F05 


ECO101711 


/»■ 4 a a A A A A A /\ t 

S1M10000002D01 


A a T TO AAC A O 

SAU800548 


pi x/i AAAAA/1*7 A A'J 

5> 1 Ml 000004 / AU J 


CAT TQOOS/lf^ 


E1M10000219H05 


ECO104286 


A -f X X 1 AAAAAAAf'AI 

S1M10000002E01 


OAT IOAA/CAA 

SAU8OO0OO 


C 1 \A 1 AAAAA/l'7/^n'* 


CAT TR0 1 1 Rfi 
oAU ovl loO 


E1M10000219B06 


ECO103262 


A 4 » M 4 A A AA A A*\T"»A 1 

S1M10000002F01 


OAT TO A A C /II 

SAU800543 


O 1 \>T 1 AA AAA/1 THAI 

v>lM 1000004 /UU3 


CAT TS01 7A^ 
bAUoUlZOJ 


E1M10000219B06 


ECO103878 


n 4 \ ST i A A A A A A A A A a 

S1M10000002A02 


OAT TOAA£ff 

SAU802655 


O 1 X A 1 AAA AA/t TCA1 

b 1 M 1 000004 / JiU3 


CAT TCOOOI ft 


E1M10000219B06 


ECO204942 


S1M10000002C02 


r*l A ▼ TAAA A C\ f 

SAU802496 


O 1 X M 1 AAAAA/1 TCA1 

S 1 M 10000047r 03 


CAT TOA1 *71 O 


E1M10000219C06 


ECO 103 100 


S1M10000002D02 


A A T T O A A O O 1 

SAU800381 


O 1 X jTI AAAAA/nPAO 

SI Ml 0000047003 


CAT TOA*) AOA 


E1M10000219G07 


ECO 103243 


nil /1 AA A A A A^FAI 

S1M10000002E02 


OAT TOAACA^ 

SAU802506 


O 1 X A 1 A A A A A /I *7T_T A 1 

S 1 M 1 00000471103 


CAT TOAOAA^ 


E1M10000219H07 


ECO 103884 


n 4 % -4 A A A A A A AT^ A A 

S1M10000002F02 


SAU802502 


O 1 X A 1 AA AAA/1*7 A AvI 

S1M100 00047 A04 


CAT TOAA^I Q 

oAU juuoiy 


E1M10000219A08 


ECO 1008 8 6 


Ai * 1 aT 4 A A A A A A A A A 

S1M10000002B03 


O 1 T 70A1 T1 A 

SAU801712 


O 1 X At AAAAA/1 TO A/1 

S 1 M 1 0000047B04 


CAT TQAA1 /1ft 
oAUoUU14o 


E1M10000219A09 


ECO103161 


g»+ *\ m x i A A A A A A A Y~\. A A 

S1M10000002D03 


O A TTOAA/Trf 

SAU802655 


O 1 X if 1 AAAAA /I If 1 A>1 

S 1 M 1 0000047C04 


CAT TOA1 1 1Q 

0AU0OI 13^ 


E1M10000219E09 


ECO100193 


4 ■v r 4 A A A A A A A A A 

S1M10000002G03 


A ATIOAIIjIA 

SAU801740 


S 1 M 1 0000047D04 


CAT T0AAC/1/C 

bAU600!>4o 


E1M10000219E09 


ECO 100 194 


S1M10000002B04 


a a t rnAAi* in 

SAU800548 


S 1 M 1 0000047E04 


OAT TOAAOO 

SAUb0052o 


E1M10000219A10 


ECO 103 8 83 


A^ -4 % MA A A A A AAA I""* A r 

S1M10000002B05 


A T T O A 1 A "> 

SAU801263 


pi i f i a.AAAA/I*7T?A/I 

S1M10000047F04 


OAT 71 AA<C1 O 

L>AU300oiy 


E1M10000219A10 


ECO103884 


A « > i 1 AAAAAAA T~X A /" 

S1M10000002D05 


A ATTOAAA1A 

SAU800210 


n i x / i AAAAA^nAAil 

S1M10000047G04 


OAT TOA1 *7A1 

bAUoOl /OI 


E1M10000219E10 


ECO 104093 


S1M10000002G05 


A a T TAAA/^ >4A 

SAU800548 


n < I jr 1 AAAAAyl *7T T f\ A 

S1M10000047H04 


OAT TOA1 C OaC 

SAU80250O 


E1M10000219D11 


ECO101324 


n i \ »•« AAAAAAA Y"> A /* 

S1M10000002B06 


A A T TO AAC A /" 

SAU800546 


p 1 l /1 AAAAAilirTA/l 

S1M10000047H04 


OAT TO AO COC 


E1M10000220B01 


ECO 103 8 82 


■* y i A A A A A A A A 

S1M10000002G06 


Ai A TTAAAa^ a a 

SAU800548 


<"1 t H # 1 AAAAA A T A A AT 

SIM 10000047 A05 


OAT TOAIIH/C 


E1M10000220C01 


ECO 103 8 82 


^« 1 \ 11 AAAAA A AT^* A ^ 

S1M10000002B07 


O 1 1 TO A A -1 A f 

SAU802105 


O 1 Ik #■» AAAAA VI in f\C 

S1M10000047B05 


O ATTOA1 1 OO 

SAUoOl loJ 


E1M10000220D01 


ECO 100886 


n 1 X / I AAAAAAA A *"T 

S1M10000002B07 


OAT TOAA 1 f\£ 

SAU802106 


O 1 X At AAAAA/ HAAf 

S1M10000047O05 


CAT TQAA"3^*7 


i7i "N/f i aaaaooatj ai 
biM1000UZ20rUl 




c i m 1 oooooo9no7 
o I lvi 1 uuuuuuzuu / 




C 1 Ml 00000471^0*, 
ol IVl 1UUUUU4 tUVD 


CAT 1801 119 


E1M10000220F02 


ECO101196 


S1M10000002D07 


SAU801899 


S1M10000047E05 


SAU802217 


E1M10000220A03 


ECO 104091 


S1M10000002E07 


SAU802496 


S1M10000047F05 


SAU801139 


E1M10000220A03 


ECO 104092 


S1M10000002F07 


SAU800283 


S1M10000047G05 


SAU801670 


EIM10000220B03 


ECO 101 324 


S1M10000002D08 


SAU801900 


S1M10000047G05 


SAU801669 


E1M10000220F04 


ECO102555 


S1M10O000O2GO8 


SAU800547 


S1M10000047H05 


SAU800018 



289 



WO 02/077183 PCT/US02/09107 



Clone Name 


Gene 
LocusH) 


Clone Name 


Gene 
LocusID 


Clone Name 


Gene 
LocusID 


E1M10000220F04 


ECO102556 


SlM10000002AOy 
SlM10000002B0y 


SAU801113 
SAU802496 


S1M10000047A06 
S1M10000047B06 


SAU801434 
SAU801186 


E1M10000220G04 

P 1 X#1 AfkAAOO AHA** 


ECO101684 
FP0 103242 


S1M10000002C09 


SAU800542 


S1M10000047C06 


SAU802217 


E1M10000220E05 
E1M10000220H05 
E1M10000220B06 
E1M10000220D06 
ElMlOOOOzzUDUO 


ECO101257 
ECO100171 
ECO 101 298 
ECO103264 

PP0 103965 


S1M10000002E09 
S1M10000002F09 
S1M10000002A10 
S1M10000002C1O 
S1M10000002G10 


SAU800547 
SAU801481 
SAU802496 
SAU800385 
SAU801113 


S1M10000047E06 
S1M10000047F06 
S1M10000047G06 
S1M10000047A07 
S1M10000047C07 


SAU802224 
SAU800543 
SAU802244 
SAU802224 
SAU802251 


ElMlOUuUzZUrUO 

r<i \ / 1 ArtAAIOfV A AO. 

E1M10000220AU5 
ElM10000220LA)o 

T7 1 * K 1 AAAAIO A A AO 

E 1 M 1 0000220 AUy 

t-j ii ii AAAA11 ATN AO 

EiM10000220iJUy 


ppn 103451 
ppni 0^963 

cpA 100777 
FPPI1 03884 
PPO10^99S 


S1M10000002B11 
S1M10000002C11 
S1M10000002E11 
S1M10000002A12 
S1M10000002C12 


SAU801475 
SAU800444 
SAU802655 
SAU802502 
SAU800966 


S1M10000047D07 
S1M10000047F07 
S1M10000047GO7 
S1M10000047H07 
S1M10000047A08 


SAU800367 
SAU802247 
SAU802233 
SAU801186 
SAU802247 


T? 1 X jM AAAAIO AT\AQ 

ElM10000220JJUy 


prni 0^996 


S1M10000002D12 


SAU800967 


S1M10000047B08 


SAU802223 


t> i x k 1 AAAAi o nnno 
E 1 M 1 000022UUUy 


prni040Q0 


S1M10000002E12 


SAU801264 


S1M10000047CUS 


SAU802223 


T7 1 X At AAAAT1A All 

E1M100UU220A1. 1 


cpa 109744 


S1M10000002F12 


SAU800548 


S1M10000047E08 


SAU802586 


ci xx 1 aaaaooapti 1 
DiMxwUUZZUrll I 




S1M10000002G12 


SAU800548 


S1M10000047E08 


SAU802585 


pixji AAAA001 U01 
blMxUUUUZZxDUx 


FCO101 197 


S1M10000003AOI 


SAU802496 


S1M10000047F08 


SAU801719 


P1 M1 AOA0991F01 


FPO 103226 


S1M10000003E01 


SAU800700 


S1M10000047G08 


SAU800753 


pi mi aooo99 ifoi 


FCO 103227 


S1M10000003A02 


SAU800607 


S1M10000047H08 


SAU802233 


p i m i 000099 1 R09 


FCO103228 


SI Ml 0000003 F02 


SAU800548 


S1M10000047A09 


SAU801139 


p i mi 000099 i 7309 


FCO 103229 


SI Ml 0000003 A03 


SAU800542 


S1M10000047B09 


SAU800942 


p i m i 000099 1 F09 
xi l ivi i uuuuzz i r 


FCO102555 


S1M10000003G03 


SAU800548 


S1M10000047C09 


SAU801139 


P 1 \ A 1 aaaoo 9 1 PA? 

ii l JYl l u uuuz z l r uz 


FPO 1025 56 


S1M10000003A04 


SAU801621 


S1M10000047D09 


SAU802082 


P1 A>T1 AAA AO PA? 

ii I M I UUUUZZZiJUZ 


FPO101 505 


S1M10000003E04 


SAU802247 


S1M10000047E09 


SAU801060 


tjiiii HAAAOOO'nAO 

b 1 M 1 UUUUzzzJJUZ 


PPO104176 


S1M10000003G04 


SAU802496 


S1M10000047E09 


SAU801061 


P 1 \A 1 ftAAAOOOFifiO 

li 1 M 1 uuuuzzzuuz 


PPO 1041 77 


S1M1 0000003 A05 


SAU800700 


S1M10000047F09 


SAU800546 


T71 X/f 1 fiAAAOOOPO^ 

tLlM lUUUUZZZllU 3 


FPmOl R40 


S1M10000003F05 


SAU802309 


S1M10000047G09 


SAU801060 


P 1 1\/T 1 AAAA999P0^ 
ClIVLxUUUUZZZllUJ 


FPO101 P*50 


S1M1 0000003 A06 

U X ATX 1 WWW*/* fcvx/ 


SAU801904 


S1M10000047G09 


SAU801061 


pi mioooo9?9F05 
c» iiviiuvuuzzzruj 


FPO101475 


S1M10000003B06 


SAU800519 


S1M10000047H09 


SAU800367 


P 1 Ml OOO0999F05 


FCO101476 


S1M10000003C06 


SAU800014 


S1M10000047A10 


SAU801301 


F1M1OOOO999F05 


ECO201962 


S1M10000003D06 


SAU800528 


S1M10000047B10 


SAU801253 


F1M1 0OOO9991309 


ECO 102 174 


S1M10000003F06 


SAU800547 


S1M10000047D10 


SAU800006 


P1 Ml 0OOO999R10 

C 1 1V1 i UWUUZZZO 1 u 


FPO101174 


S1M10000003A07 


SAU800548 


S1M10000047E10 


SAU800543 


P 1 M 1 0000999P 1 9 
EliYll UUUUZZZO 1 Z 


FPO 103481 


S1M10000O03CO7 


SAU801139 


S1M10000047F10 


SAU800006 


P 1 M 1 0OOn99?F 1 ? 


FPO 100652 


S1M10000003E07 


SAU801644 


S1M10000047G10 


SAU800256 


P 1 M 1 000O99^P0zl 
Ji 1JV1 1UUUUZZ.3L/U*t 


FPO103963 


S1M10000003F07 


SAU802506 


S1M10000047H10 


SAU801184 


P i Tvyri 000099 ^P04 

C llVl 1 UUUUZZ jrUH 


I FPO 100095 


S1M10000003B08 

fcj X LiX X WWw\#«^ J—' w W 


SAU800967 


S1M10000047A11 


SAU800942 


PI TvyTI 00OO99 *W05 
E, 1 xVl 1UU UUZZ J JCIUJ 


FPO 103231 


S1M10000003D08 

i ITX X WWw w J—^ w w 


SAU802262 


. S1M10000047B11 


SAU802090 


F 1M1 000099 "*T-H 1 


FPO 102802 


S1M10000003A09 

J. 1TX X WW W W W«p/^ ».w 


SAU800700 


S1M10000047C11 


SAU801434 


F 1 Ml 0000224F05 


ECO 103076 


S1M10000003B09 


SAU800824 


S1M10000047E11 


SAU801253 


E1M100002^5G01 


ECO 1 03 641 


S1M10000003E09 


SAU800548 


S1M10000047F11 


SAU802226 


E1M1 0000225 A02 


ECO 103881 


S1M10000003A10 


SAU800539 


S1M10000047H11 


SAU800753 


E1M10000225A02 


ECO103882 


S1M10000003C1C 


SAU800548 


S1M10000047A12 


SAU800537 


E1M1OOOO225C02 


ECO 103785 


S1M10000003D1C 


SAU800122 


S1M10000047B12 


SAU801263 


F1M10O00225F02 


ECO 100008 


S1M10000003E1C 


SAU80150C 


) S1M10000047C12 


SAU801263 


F1M10000225F02 


ECO 100009 


S1M10000003A11 


SAU801113 


S1M10000047D12 


SAU801831 


E1M10000225F03 


ECO100435 


S1M10000003E11 


SAU80054J 


1 S1M10000047E12 


SAU800543 


E1M10000225H03 


ECO101066 


» S1M100000O3B12 


t SAU80143^ 


i S1M10000047F12 


. SAU802223 


E1M10000225F04 


ECO101583 


S1M10000003B12 


! SAU30289: 


1 S1M10000048C01 


SAU800753 


E1M10OO0225A0C 


i ECO 1032 IS 


, S1M10000003C1: 


I SAU80054^ 


5 S1M10000048D01 


SAU802654 


E1M10000225B0* 


i ECO10125S 


) S1M10000003F12 


> SAU801621 


S1M10000048G01 


SAU800363 


El M 1 0000225B07 ECO 1 036U"i 


r S1M10000004C0 


i SAU80265! 


5 S1M10000048H01 


SAU801740 
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Clone Name 

E1M10000225E08 
E1M10000225E09 
E1M10000225H09 
E1M10000225H09 
E1M10000225F10 

ci \a i AAA099^m 9 
blMlUUUUZZjUiz 

E1M10000225D12 
E1M10000225G12 
ElM10000226fcOi 
E1M10000226B02 
E1M10000226B02 


Gene 
LocusID 
ECO102057 
ECO102878 
ECO101953 
ECO101989 
ECO101684 
ECO102966 
EC01 02967 
ECO101128 
ECO 100469 
ECO 100683 
ECO103405 


Clone Name 

S1M10000004D01 
S1M10000004D01 
S1M10000004F01 
S1M10000004G01 
S1M10000004C02 
" S1M10000004F02 
S1M10000004B03 
S1M10000004B03 
S1M10000004D03 
S1M10000004E03 
S1M100O00O4GO3 


Gene 
LocusID 
SAU801616 
SAU801615 
SAU800966 . 
SAU802496 
SAU802496 
SAU800546 " 
SAU800257 
SAU800258 
SAU800187 
SAU800153 
SAU800016 


Clone Name 

S1M10000048A02 

S1M10000048B02 

"S1M10000048C02 

S1M10000048D02 

S1M10000048DU2 

S1M10000048EU2 

S1M10000048F02 

S1M10000048UUZ 

S1M10000048G02 

S1M10000048H02 ' 

S1M10000048AU3 ' 


Gene 
LocusID 
SAU800006 
SAU801740 
SAU800490 
SAU802548 
SAU104011 
SAU800753 
SAU802107 
SAU800019 
SAU800020 
SAU800547 
SAU801630 


E1M10000226B02 
E1M10000226B02 
E1M10000226C02 
E1M10000226FU2 
E1M10000226D03 
E1M10000226G03 

blM lUUUUZZOrUH- 

T7 1 » * 1 rin a AO o£tro A 1 
E 1 M 1 UUUUZZOrlu** 

T7 1 A A 1 ft A A AO T /?UA A 

E 1 M 1 UUUUZZOriuo 

nn/i aaaA77£ AOS 
b 1 M 1 UUUUZZOAUO 

Tj i A>fi AAAA99^nOR 
bIMlUUUUZZOl-'UO 

■c i 1 aa A099 £nOQ 

b 1M lUUUUZZO.L'U^ 


ECO103515 
ECO204900 
ECO 103030 
ECO102999 
ECO103160 
ECO103696 
FCO 102508 
FHOI 00713 
pro 103848 
FCO104132 
FCO101753 
FCO 100430 


S1M10000004A04 
S1M10000004B04 
S1M10000004DU4 
S1M10000004D04 
S1M10000004E04 
S1M10000004UO!) 
S1M10000004A06 
S1M10000004C06 
S1M10000004D06 
S1M10000004E06 
S1M10000004F06 
S1M10000004A07 


SAU802655 
SAU801760 
SAU802224 
SAU802223 
SAU802247 
SAU800548 
SAU801644 
SAU802714 
SAU800006 
SAU802240 
SAU800152 
SAU802503 


S1M10000048B03 

S1M10000048CU3 

S1M10000048CU3 

S1M10000048E03 

S1M10000048E03 

S1M10000048F03 

S1M10000048G03 

S1M10000048H03 

S1M10000048E04 

S1M10000048G04 

S1M10000048H04 

S1M10000048H04 


SAU801184 
SAU802586 
SAU802585 
SAU802586 
SAU802585 
SAU802226 
SAU800543 
SAU802217 
SAU801183 
SAU802247 
SAU802586 
SAU802585 


t; i x/Ti oooo99£nOQ 

b 1 JVl I UU UUZZ OJLJ u7 


ECO 100431 


S1M10000004D07 


SAU802496 


. S1M10000048A05 


SAU801263 


T7 1 1UT 1 00009 9 /VR 1 n 


ECO102113 


S1M10000004E07 


SAU802177 


S1M10000048B05 


SAU800753 


T7iA/noooo996Tji 0 

b 1M lUUUUZZOL/ 1 u 


FCO 101 623 


S1M10000004E07 


SAU802176 


S1M10000048C05 


SAU801891 


t? 1 x>f i 000099 fiF i n 

E» L IVl 1 UUUvZZUC 1 \J 


FCO 1027 14 


S1M10000004F07 


SAU801683 


S1M10000048F05 


SAU801184 


v\\a ioooo99fim \ 


ECO 103244 


S1M10000004G07 


SAU801644 


S1M10000048G05 


SAU800542 


P 1 \A 1 000099 fiR 1 9 
GllVllUUUVJZZuD 1 Z> 


ECO101916 


S1M10000004G07 


SAU80I643 


S1M10000048H05 


SAU800546 


Fi^yfi 0000996FI 9 

C L IVilUvJUUZZvJr LZ. 


ECO 100240 


S1M10000004B08 


SAU801346 


S1M10000048A06 


SAU800546 


n i \a 1 000099 7FO^ 

ti I M 1 UU UUZZ / E/VJ 


FCO 10097 5 


S1M10000004B08 


SAU200535 


S1M10000048B06 


SAU801184 


f 1 \a i oooo997FO' : t 

H 1 JVl 1 UUUUZZ / CUJ 


ECO201249 


S1M10000004C08 


SAU802503 


S1M10000048C06 


SAU801670 


F1\yf1 000099 7nm 


ECO 101458 


S1M10000004D08 


SAU800517 


S1M10000048C06 


SAU801669 


T7 1 A/l 1 000099 7G0^ 


FCO101459 


S1M10000004D08 


SAU202623 


S1M10000048E06 


SAU801186 


F1 Tv/f 1 00009 9 7F04 


ECO104148 


S1M10000004F08 


SAU802224 


S1M100OO048AO7 


SAU801253 


V\\A\ 000099 7F04 


ECO104149 


S1M10000004F08 


SAU802223 


S1M10000048C07 


SAU800018 


F1 \A\ 000099 7H0A 
c 1 IVl 1 uuuuzz / nun 


FCO 100844 


S1M10000004B09 


SAU802177 


S1M10000048E07 


SAU801919 


F1\zI10000997C0S 


ECO 104 164 


S1M10000004B09 


SAU802176 


S1M10000048F07 


SAU801831 


F 1 \A 100009 9 7C0S 


FCO 104 165 


S1M10000004C09 


SAU802496 


S1M10000048G07 


SAU800520 


F1M1 0000997P07 


ECO101423 


S1M10000004F09 


SAU802496 


S1M10000048G07 


SAU800519 


F1 M1 000099 7G07 

JD 1 IVl 1 uwUU i. ^ / VJ V/ / 


ECO103885 


S1M10000004G09 


SAU801264 


S1M10000048H07 


SAU800546 


F 1 "M 1 0000227B08 


ECO102736 


> S1M10000004C10 


SAU801742 


S1M10000048B08 


SAU800018 


F1M10000227B08 


ECO 10273' 


' S1M10000004CK 


> SAU801741 


S1M10000048C08 


SAU800453 


F1M10000227D08 


ECO103911 


S1M10000004D1C 


) SAU80161S 


1 S1M10000048D08 


SAU300619 


F 1 M 1 0000227BOS 


ECO10273( 


> S1M10000004AU 


SAU801475 


i S1M10000048E08 


SAU802224 


F1M10000227B0^ 


> ECO 10273' 


1 S1M10000004B11 


SAU80111: 


i S1M10000048F0S 


> SAU800546 


F1M10000227D0S 


) ECO1039U 


[ S1M10000004C11 


SAU80070C 


) S1M10000048HOJ 


» SAU802244 


E1M10000227E11 


ECO10372! 


\ SlM10000004Ali 


I SAU80120^ 


I S1M10000048A0S 


) SAU800547 


E1M10000227E11 


ECO 10372^ 


\ S1M10000004C1: 


I SAU80051S 


) S1M10000048C0S 


) SAU800548 


E1M10000227C12 


I ECO 10039' 


1 S1M10000004D1: 


I SAU80118: 


i S1M10000048DOS 


) SAU802244 


E1M10000227CI: 


I ECO 10039! 


5 S1M10000004D1: 


I SAU80118' 


\ S1M10000048EOS 


) SAU802090 


E1M10000227D1: 


I ECO10414^ 


* S1M10000004E1: 


I SAU80052I 


I S1M10000048F0< 


) SAU802238 


E1M10000232H0: 


I ECO10323, 


5 S1M10000004F1: 


I SAU80190( 


) S1M10000048HO! 


) SAU800546 
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Clone Name 



Gene 
LocusID 



Clone Name 



Gene 
LocusID 



Clone Name 



Gene 
LocusID 



E1M10000232H02 



ECO103236 



S1M10000004G12 



SAU801342 



S1M10000048A10 



E1M10000232A03 



ECO103885 



S1M10000005A01 



SAU802496 



S1M10000048B10 



SAU800547 



E1M10000232B03 



ECO101324 



E1M10000232H03 



ECO 103097 



S1M1000Q005C01 
S1M10000005E01 



SAU802496 



S1M10000048C10 



SAU800367 



E1M10000232C07 



ECO100170 



SIM10000005B02 



E1M10000232F07 



ECO 103797 



S1M10000005D02 



SAU800996 



SAU802243 



SAU800519 



S1M10000048D10 



SAU802590 



S1M10000048E10 



SAU802590 



S1M10000048G10 



SAU802238 



E1M10000232F07 



ECO103798 



S1M10000005E02 



SAU802655 



S1M10000048H10 



SAU802240 



E1M10000232G07 



EIM10000232A08 



E1M10000232G08 



E1M10000232G12 



ECO104010 



ECO100850 



S1M100Q0005FQ2 
S1M10000005F02 



SAU801644 



S1M10000048A11 



SAU801643 



ECO 100875 



S1M10000005A03 



SAU802310 



S1M100Q0048C11 
S1M10000048D11 



SAU802224 



SAU802217 



SAU802090 



ECO102636 



S1M10000005D03 



SAU800548 



S1M10000048F11 
S1M10000048G11 



SAU802496 



E1M10000233C01 



E1M10000233A03 



E1M10000233B03 



E1M10000233D03 



E1M10000233H03 



ECO103886 
ECO 100784 



S1M10000005F03 



SAU802262 



ECO 100784 



ECO100118 



ECO 103238 



S1M10000005B04 



SAU801183 



S1M10000048H11 



SAU80U86 
SAU801139 



S1M10000005D04 



SAU801183 



S1M10000048A12 



SAU801139 



S1M10000005D04 



SAU801184 



S1M10000048B12 



SAU802502 



S1M10000005F04 



SAU800363 



S1M10000048D12 
S1M10000048D12 



SAU800250 
SAU800249 



E1M10000233H03 



ECO 103239 



S1M10000005F04 



SAU800362 



E1M10000233C04 



ECO 102309 



S1M10000005F04 



SAU800361 



S1M10000048G12 



E1M10000233G04 



ECO101185 



S1M1O000005C05 



SAU801264 



E1M10000233A05 



ECO102553 



S1M10000005D05 



SAU801644 
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Table IC provides a cross reference between PathoSeq Gene Loci listed in Table IB and the 
SEQ ID NOs. of the corresponding PathoSeq polypeptides and the SEQ ID NOs; of the nucleic 
acids which encode them. The Gene Locus IDs provided in Table IC each comprise a nine digit 
alpha-numeric identifier that can be used to determine the organism from which each Gene Locus 

5 and corresponding SEQ ID NOs. were identified. Specifically, the first letter of the Gene Locus ID 
corresponds to the first letter of the genus name of the organism described herein from which the 
Gene Locus was identified and the second and third letters of the Gene Locus ID correspond to the 
first two letters of the species name of this organism. For example, the identifier EFA205257 
describes a gene locus identified from Enterococcus faecalis. In those instances where the three 

10 letter identifier is the same for different organisms, the exact identity of the organism which 
corresponds to the Gene Locus ID can be determined by referring to the organism designation in the 
sequence listing for the coding nucleic acid or polypeptide SEQ ID NO. that corresponds to the 
particular Gene Locus ID. 
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TABLE IC 



DNA 


Protein 


Gene LocusID 


DNA 


Protein 


Gene LocusID 


DNA 


Protein 


Gene LocusID 


SeqID 


SeqID 




SeqID 


SeqID 




SeqID 


SeqID 




6214 


. 42398 


EFA205257 


18276 


54460 


CJU100846 


30337 


6652 1 


PAE203653 


6215 


42399 


EFA205258 


18277 


54461 


CJU100848 


30338 


66522 


PAE203654 


6216 


42400 


EFA205225 


18278 


54462 


CJU100855 


30339 


66523 


PAE203656 


6217 


42401 


EFA201977 


18279 


54463 


CJU100856 


30340 


66524 


PAE203658 


6218 


42402 


EFA203137 


18280 


54464 


CJU100859 


30341 


66525 


PAE203668 


6219 


42403 


EFA200840 


18281 


54465 


CJUl 00860 


30342 


66526 


PAE203670 


6220 


42404 


EFA202003 


18282 


54466 


CJU100861 


30343 


66527 


PAE203672 


6221 


42405 


EFA200807 


18283 


54467 


CJU100862 


30344 


66528 


PAE203677 


6222 


42406 


EFA200811 


18284 


54468 


CJU100863 


30345 


66529 


PAE203684 


6223 


42407 


EFA201987 


18285 


54469 


CJU100866 


30346 


66530 


PAE203691 


6224 


42408 


EFA201980 


18286 


54470 


CJU100870 


30347 


6653 1 


PAE203698 


6225 


42409 


EFA201981 


18287 


54471 


CJU100871 


30348 


66532 


PAE203722 


6226 


42410 


EFA205229 


18288 


54472 


CJU100872 


30349 


66533 


PAE203732 


6227 


42411 


EFA201028 


18289 


54473 


CJUl 00885 


30350 


66534 


PAE203735 


6228 


42412 


EFA201993 


18290 


54474 


CJU100886 


3035 1 


66535 


PAE203739 


6229 


42413 


EFA201974 


18291 


54475 


CJU100888 


30352 


66536 


PAE203740 


6230 


42414 


EFA201975 


18292 


54476 


CJU100890 


30353 


66537 


PAE203741 


6231 


42415 


EFA202001 


1 8293 


54477 


CJU100891 


30354 


66538 


PAE203742 


6232 


42416 


EFA200839 


18294 


54478 


CJU100896 


30355 


66539 


PAE203743 


6233 


42417 


EFA201985 


18295 


54479 


CJU100903 


30356 


66540 


PAE203744 


6234 


42418 


EFA201984 


1 8296 


54480 


CJU100923 


30357 


66541 


PAE203751 


6235 


42419 


EFA202953 


18297 


54481 


CJU100925 


30358 


66542 


PAE2Q3754 


6236 


42420 


EFA202022 


18298 


54482 


CJU100929 


30359 


66543 


PAE203755 


6237 


42421 


EFA202028 


18299 


54483 


CJUl 00938 


30360 


66544 


PAE203757 


6238 


42422 


EFA202536 


18300 


54484 


CJUl 00944 


30361 


66545 


PAE203758 


6239 


42423 


EFA200412 


18301 


54485 


CJU100951 


30362 


66546 


PAE203760 


6240 


42424 


EFA201999 


! 18302 


54486 


CJU100955 


30363 


66547 


PAE203766 


6241 


42425 


EFA201997 


| 18303 


54487 


CJU100961 


30364 


66548 


PAE203774 


6242 


42426 


EFA200624 


I8304 


54488 


CJU100965 


30365 


66549 


PAE203796 


6243 


42427 


EFA201983 


18305 


54489 


CJU100966 


30366 


66550 


PAE203797 


6244 


42428 


EFA200812 


18306 


54490 


CJUl 00967 


30367 


66551 


PAE203799 


6245 


42429 


EFA200660 


18307 


54491 


CJU100970 


30368 


66552 


PAE203800 


6246 


42430 


EFA200661 


18308 


54492 


CJU100980 


30369 


66553 


PAE203801 


6247 


42431 


EFA202276 


: 18309 


54493 


CJU100982 


30370 


66554 


PAE203810 


6248 


42432 


EFA201982 


18310 


54494 


CJU100983 


30371 


66555 


PAE203812 


6249 


42433 


EFA202214 


i I83ll 


54495 


CJU100987 


30372 


66556 


PAE203813 


6250 


42434 


EFA202216 


18312 


54496 


CJU100989 


30373 


66557 


PAE203820 


6251 


42435 


EFA200360 


18313 


54497 


CJU100993 


30374 


66558 


PAE203821 


6252 


42436 


EFA200766 


18314 


54498 


CJU100996 


30375 


66559 


PAE203828 


6253 


42437 


EFA200805 


| 18315 


54499 


CJU100997 


30376 


66560 


PAE203829 


6254 


42438 


EFA201637 


18316 


54500 


CJU100998 


30377 


66561 


PAE203831 


6255 


42439 


EFA201986 


18317 


54501 


CJU100999 


30378 


66562 


PAE203850 


6256 


42440 


EFA205255 


18318 


54502 


CJU101006 


30379 


66563 


PAE203869 


6257 


42441 


EFA200495 


18319 


54503 


CJU101007 


30380 


66564 


PAE203871 


6258 


42442 


EFAl 04836 


18320 


54504 


CJUlOlOll 


30381 


66565 


PAE203872 


6259 


42443 


EFA201976 


18321 


54505 


CJU101017 


30382 


66566 


PAE203887 


6260 


42444 


EFA201523 


18322 


54506 


CJU101021 


30383 


66567 


PAE203888 


6261 


42445 


EFA202012 


18323 


54507 


CJU101022 


30384 


66568 


• PAE203892 


6262 


42446 


EFA202007 


18324 


54508 


CJU101027 


30385 


66569 


PAE203897 


6263 


42447 


EFA20O3O7 


18325 


54509 


CJU101028 


30386 


66570 


PAE203898 


6264 


42448 


EFA201888 


18326 


545 10 


CJU101029 


30387 


6657I 


PAE203900 


6265 


42449 


EFA205285 


18327 


545 ll 


CJU101031 


30388 


66572 


PAE203911 
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PNA 


Protein 


Gene LocusED 


SeqID 


SeqID 




6266 


42450 


EFA200842 


6267 


42451 


EFA201204 


6268 


42452 


T^T"» A f\ i\ •» S~ ^ 1 

EFA201611 


6269 


42453 


EFA201946 


6270 


42454 


EFA201512 


6271 


42455 


EFA205288 


6272 


42456 


EFA203140 


6273 


42457 


EFA200544 


6274 


42458 


EFA203904 


6275 


42459 


EFA202006 


6276 


42460 


EFA202298 


6277 


42461 


EFA200326 


6278 


42462 


EFA200662 


6279 


42463 


EFA202217 


6280 


42464 


EFA201869 


6281 


42465 


EFA203598 


6282 


42466 


EFA200894 


6283 


42467 


EFA200613 


6284 


42468 


EFA202166 


6285 


42469 


EFA202168 


6286 


42470 


EFA201087 


6287 


42471 


EFA201084 


6288 


42472 


EFA201883 


6289 


42473 


EFA201905 


6290 


42474 


EFA202015 


6291 


42475 


EFA202274 


6292 


42476 


EFA200916 


6293 


42477 


EFA200179 


6294 


42478 


EFA200358 


6295 


42479 


EFA200747 


6296 


42480 


EFA200746 


6297 


42481 


EFA201379 


6298 


42482 


EFA201163 


6299 


42483 


EFA202160 


6300 


42484 


EFA200457 


6301 


42485 


EFA202177 


6302 


42486 


EFA202176 


6303 


42487 


EFA200562 


6304 


42488 


EFA200192 


6305 


42489 


EFA202180 


6306 


42490 


EFA201749 


6307 


42491 


EFA203429 


6308 


42492 


EFA202378 


6309 


42493 


EFA202013 


6310 


42494 


EFA202200 


6311 


42495 


EFA203606 


6312 


42496 
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47665 


BCE105992 


23543 


59727 


KPN302511 


35604 


71788 

III OO 


SHA101725 


11482 


47666 


BCE105994 

JU/ V-/ i_J * V/ S S 


23544 


59728 


KPN302512 


35605 


71789 


RHA101737 


11483 


47667 


BCE106023 


23545 


59729 


KPN302515 


35606 


71790 


SHA101738 


11484 


47668 


BCE106024 


23546 


59730 


KPN302517 


35607 


71791 


SHA101739 


11485 


47669 


BCE106056 


23547 


59731 


KPN302520 


35608 


71792 


SHA101741 


11486 


47670 


BCE106065 


23548 


59732 


KPN302522 


35609 


71793 


SHA101742 


11487 


47671 


BCE106082 


23549 


59733 


KPN302526 


35610 


71794 


SHA101743 


11488 


47672 


BCE106087 


23550 


59734 


KPN302527 


35611 


71795 


SHA101749 


11489 


47673 


BCE106124 


23551 


59735 


KPN302540 


35612 


71796 


SHA101753 


11490 


47674 


BCE106127 


23552 


59736 
389 


KPN302541 


35613 


71797 


SHA101761 
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SeqID 




1 1 Af\t 

11491 


47675 


oCE106131 


1 i a ai 

11492 


47676 


BCE106163 


11493 




BCE106172 


1 1 Af\A ' 

11494 


47678 


BCE106179 


i 1 a nc 

11495 




BCE106184 


1 1 AC\£ 

11496 


47680 


BCE106187 


1 1 A A*7 

11497 


47ool 


BCE1 06205 


11498 


47682 


Tiz^i?! Arm o 

BCE106238 


1 1 A AA 

11499 


47683 


BCEl 06244 


1 1 CAA 

11500 


A *7£OA 

47684 


BCEl 06246 


11501 


47685 


BCE106256 


11502 


47686 


BCEl 06262 


11503 


47687 


BCEl 06326 


11504 


47688 


BCE106333 


11505 


47689 


BCE106339 


11506 


47690 


BCE106341 


1 1507 


47691 


BCEl 06343 


1 1 CAO 

11508 


A OiCAO 

4/692 


BCE106345 


1 1 C AA 

1 1509 


4/693 


BCEl 06359 


1 1 CI A 

11510 


A *7/Zf\A 

47694 


BCEl 06362 


lien 
11511 


47695 


BCE106431 


11512 


47696 


BCE106437 


11513 


47697 


BCE106451 


11514 


47698 


BCEl 06489 


11515 


47699 


BCE106496 


11516 


47700 


BCE106500 


11517 


47701 


BCE106510 


11518 


47702 


BCEl 065 11 


11519 


47703 


BCE106540 


11520 


47704 


BCE106577 


11521 


47705 


BCE106581 


11522 


47706 


BCEl 06620 


11523 


47707 


BCE106651 


11524 


47708 


BCE106664 


11525 


47709 


BCE106666 


11526 


47710 


BCE106677 


11527 


47711 


BCE106700 


1 1 oo 

11528 


47712 


BCE106716 


1 1 CIA 

11529 


47713 


BCE106766 


11530 


ATll A 


BCEl 06802 


1 1 CI 1 

1 1531 


ATJ1 c 

47715 


BCE106818 


11532 


47716 


BCEl 06878 


11533 


47717 


BCEl 06900 


11534 


47718 


BCE106919 


11535 


47719 


BCEl 06925 


11536 


47720 


BCE106939 


11537 


47721 


BCE106941 


1 1 c*> o 

11538 


47722 


BCE106968 


11539 


47723 


BCE106981 


11540 


47724 


BCEl 07000 


11541 


47725 


BCE107002 


11542 


47726 


BCE107003 


11543 


47727 


BCE107013 


11544 


47728 


BCE107019 


11545 


AU29 


BCE107035 



DNA 


Protein 


Gene LocusID 


SeqID 


SeqID 






CA*7T7 

5973/ 


rwrlNoUZ54Z 




59738 


JSJriH jUZj4 / 




59739 


lyrlN^UZjjl 


Zjj DO 


jy /4U 


V-pXT*IA7 CC/T 

JvtINjUZjjO 


Zjjj / 


jy /hi 


JSJrlNJUZjOI 


Zjj OB 


CQ7A7 

jy /4z 


iSJTlN JUZjOO 


Zjjjy 


« Q 7/1/3. 

jy /4j 


J\JrINjUZJ/ J 


ZjjOU 


jy /*t*t 


JSJrlN jUZj / j 


ZjjOX 


jy /*fj 


J^JTIN JUZJ /0 


ZjjOZ 


jy /40 


J\1^XN JUZjoj 


ZiDOi 


jy /4/ 


JSJrlN^UZJO / 


C/T/t 

ZJj04 


jy /4o 


JvtXNjUZjoo 


23565 


5y /4y 


V TJXT1 AO C Q A 

1SXJN3UZ59U 


23566 


59 /5U 


JsjrIM30z5yZ 


23567 


59/51 


"fPIVll AO < 

ivrJNjUZjyj 


ZJjOo 


jy /jz 


rvriNJUZjy4 


zjjoy 


jy /jj 


JS^riNjUZjyo 


Zjj /u 


jy /j*f 


JSJ^lN JUZOul 


Zjj / 1 


jy /jj 


K'PXJ'^AO^n? 

JSJrrHjVzovz 


Zjj /Z 


CA7CZT 

jy /do 


VPTJ^AO/^A^ 
JvrlN JVJZOUj 


Zjj / j 


oy / j / 


VPXI1A7AA7 


Zjj /4 


jy /jo 


VPXT1A7A1 1 
JsjtJNjUZOH 


23575 


5975y 


^"DXTO AO /OA 

JsjrJN3U2oZU 


23576 


CA*7/CA 

59760 


xCrN302625 


23577 


C A*7£1 

59761 


KFN302628 


23578 


5976Z 


VDXTO AO /CO C 

KjtN 302635 


23579 


59763 


VT>XTO AO/C/1 O 

JsjrJN30Z648 


TJf OA 
ZJJOU 


59/64 


VDXTO AO ACO 

KjtN30Z65o 


ZJjcSi 


jy /oj 


JnjtINjUZOOcS 


TJCC7 

Z^joZ 


jy /00 


T^PXT7A0^71 


ZJJOi 


59/0/ 


VPXTIAOrfC/71 


23584 


59 /60 


Vt>XTO AO /CO/C 

KjrJN3UZ6/6 


23585 


59/09 


VT5XTO AO /C 0 A 

KrfN30Zo84 


23586 


CAT7A 

59770 


V"DXTO AO /CQC 

JsjtIN30Zoo5 


23587 


CAT7 1 

59771 


17"nXTO AO ^0 0 

KrfN302688 


T3 COO 

23588 


59772 


X^"DXTO AO /TnO 

KrM302693 


00 coo 
23589 


597/3 


V DXTO AO OA 1 

Krrvl30Z/01 


TJCAA 


597 /4 


^ "DXTO AO 7 A Q 

J\jrlN30Z/Uo 


7KQ1 

ZjjV 1 


59/ /5 


T^pXTOA771 1 

rJrlNjUZ/I A 


zjjvz 


cat?/: 

59 / /O 


VPXT0A071 < 
JvrXN jUZ / I j 


79 CO! 

Z3593 


CA777 

5y /// 


l r PXT'2A070A 
iSJrfNJUZ /ZU 


T3 CA>| 

23 5 94 


59//8 


1/"DXT O AO ni 0 

KriN30Z/3y 


CAC 

23595 


59779 


KrIM 302744 


C A/T 

23596 


CATOA 

59780 


t^TJXTO AO O/l *7 

KrN 302747 


^"3 CAT 

23597 


59781 


KjrIM30274y 


0*J CAO 

23598 


59782 


JsjrrN30275z 


n con 
2359y 


59783 


VDXTOAOOCC 

lsJrIN5UZ/5o 


ZjoUU 


59/84 


1/" DXT 0 AO 7 ^ O 

jvrlNjlJZ/jy 


ZJoUl 


CQ7QC 

59/85 


VT>XT7A0777 

JSjrJNjUZ/// 


23602 


59786 


KPN302783 


23603 


59787 


KPN302790 


23604 


59788 


KPN302795 


23605 


59789 


KPN302797 


23606 


59790 


KPN302800 


23607 


59791 


KPN302804 




390 





PCT/US02/09107 



DNA 


Protein 


Gene LocusID 


SeqID 


SeqID 




35614 

J Jv A~ 


71798 


SHA101762 


35615 

J%/VX«/ 


71799 


SHA101765 


35616 


71800 


SHA101766 


35617 


71801 


SHA101771 


35618 


71802 


SHA101775 


35619 


71803 


SHA101777 


35620 


71804 


SHA101788 


35621 


71805 


SHA101790 


35622 


71806 


SHA101796 


35623 


71807 


SHA101810 


35624 


71808 


SHA101818 


3562o" 


71809 


SHA101827 




71810 


SHA101832 




71811 

/ lull 


SHA101835 


35618 


71812 
j 10 ii- 


SHA101837 


35629 


71813 


SHA101838 


35630 


71814 


SHA101846 


35631 


71815 


SHA101847 


35632 


71816 


SHA101854 


35633 


71817 


SHA101855 


35634 


71818 


SHA101864 


35615 


71819 

/ 1017 


SHA101869 


35616 


71820 


SHA101870 

Olirv lulu /v 


15617 

J JUJ / 


71821 


SHA101871 

ullAlUlO / 1 


15618 


71822 


SHA101874 


15619 

JJUJ7 


71823 


SHA101878 


15640 


71824 


SHA101896 


15641 


71825 


SHA101897 

IJX irv 1UID7 / 


35542 


71826 


SHA101900 

UilA Ivi y\J\l 


35643 


71827 


SHA101901 

Ul 1A X \J X S\J X 


15644 


71828 


SHA101903 


15645 


71820 


SHA101005 


15646 


71810 

/ XOJV 




15647 

J JO*T / 


71811 

/ IOJ i 


SHA101016 


15648 

J JUtO 


71817 


QUA 1 01017 




71811 
/ 10 jj 


CWA 101018 

kJiiA X V/ X 7 X O 


15650 

J JOJU 


71814 

/ 1 O J*T 


SHA101010 


15651 


71815 

/ X O J J 


SHA101030 

iji irv ivi> ju 


35652 


71836 


SHA101932 


35653 


71837 


SHA101939 


35654 


71838 


SHA101943 

\JX 1A IU1 « — ' 


15655 


71819 


SHA101955 

OllA 1 u 1 y J J 


15656 


71840 


SHA101061 
oxxrvi ui7u 1 


15657 

J JUJ / 


71841 

/ 1 Ot 1 


ctj a 101065 


15658 


71847 


qu a 101067 

OllA IVI 7U / 


1565Q 


71841 


qua 101969 

OllA X \J X ^ \JJ 


35660 


71844 

/ X 0*T*T 


SHA 101074 

Oil A IvlJ / *t 


35661 


71845 


vSHAl 01080 

OX XI V lUly OU 


35662 


71846 


SHA101983 

OX IA lUl^UJ 


35663 


71847 


SHA 101090 

JiiA iviy/v 


35664 


71848 


SHA101993 


35665 


71849 


SHA101994 


35666 


71850 


SHA 102001 


35667 


71851 


SHA1 02002 


35668 


71852 


SHA102003 



WO 02/077183 




DNA 
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Gene LocusID 


SeqID 


SeqID 




11546 


47730 


BCE 107041 


11547 


47731 


BCE1 07050 


11548 


47732 


BCE107064 


11549 


47733 


BCE107071 


11550 


47734 


BCE107083 


11551 


47735 


BCE1 07096 


11552 


47736 


BCE107115 


11553 


47737 


BCE107126 


11554 


47738 


BCE107136 


11555 


47739 


BCE107142 


11556 


47740 


BCE107211 


11557 


47741 


BCE107218 


11558 


47742 


BCE107273 


11559 


47743 


BCE107277 


11560 


47744 


BCE107288 


11561 


47745 


BCE 107290 


11562 


47746 


BCE107302 


11563 


47747 


BCE107311 


11564 


47748 


BCE107325 


11565 


47749 


BCE107342 


11566 


47750 


BCE107345 


11567 


47751 


BCE1 07353 


11568 


47752 


BCE107363 


11569 


47753 


BCE107366 


11570 


47754 


BCE107415 


11571 


47755 


BCE107416 


11572 


47756 


BCE107431 


11573 


47757 


BCE107448 


11574 


47758 


BCE107483 


11575 


47759 


BCE107484 


11576 


47760 


BCE107495 


11577 


47761 


BCE1 07496 


11578 


47762 


BCE107511 


11579 


47763 


BCE107517 


11580 


47764 


BCE107531 


11581 


47765 


BCE107538 


11582 


47766 


BCE107571 


11583 


47767 


BCE107575 


11584 


47768 


BCE107576 


11585 


47769 


BCE107587 


11586 


47770 


BCE107605 


11587 


47771 


BCE107623 


11588 


47772 


BCE107626 


11589 


47773 


BCE107637 


11590 


47774 


BCE107642 


11591 


47775 


BCE107667 


11592 


47776 


BCE107676 


11593 


47777 


BCE107682 


11594 


47778 


BCE107689 


11595 


47779 


BCE107697 


11596 


47780 


BCE107702 


11597 


47781 


BCE107711 


11598 


47782 


BCE107747 


11599 


47783 


BCE107754 


11600 


47784 


BCE107762 



DNA Protein Gene LocusID 
SeqID SeqID 



23608 


59792 


KPN302806 


23609 


59793 


KPN302808 


23610 


59794 


KPN302809 


23611 


59795 


KPN302810 


23612 


59796 


KPN302817 


23613 


59797 


KPN302822 


23614 


59798 


KPN302831 


23615 


59799 


KPN302833 


23616 


59800 


KPN302836 


23617 


59801 


KPN302840 


23618 


59802 


KPN302841 


23619 


59803 


KPN302863 


23620 


59804 


KPN302864 


23621 


59805 


KPN302865 


23622 


59806 


KPN302879 


23623 


59807 


KPN302890 


23624 


59808 


KPN302908 


23625 


59809 


KPN302911 


23626 


59810 


KPN302916 


23627 


59811 


KPN302924 


23628 


59812 


KPN302939 


23629 


59813 


KPN302944 


23630 


59814 


KPN302949 


23631 


59815 


KPN302952 


23632 


59816 


KPN302957 


23633 


59817 


KPN302967 


23634 


59818 


KPN302968 


23635 


59819 


KPN302981 


23636 


59820 


KPN302997 


23637 


59821 


KPN303003 


23638 


59822 


KPN303005 


23639 


59823 


KPN303031 


23640 


59824 


KPN303040 


23641 


59825 


KPN303047 


23642 


59826 


KPN303050 


23643 


59827 


KPN303051 


23644 


59828 


KPN303054 


23645 


59829 


KPN303056 


23646 


59830 


KPN303058 


23647 


59831 


KPN303060 


23648 


59832 


KPN303068 


23649 


59833 


KPN303070 


23650 


59834 


KPN303072 


23651 


59835 


KPN303075 


23652 


59836 


KPN303078 


23653 


59837 


KPN303095 


23654 


59838 


KPN303098 


23655 


59839 


KPN303099 


23656 


59840 


KPN303110 


23657 


59841 


KPN303112 


23658 


59842 


KPN303124 


23659 


59843 


KPN303125 


23660 


59844 


KPN303133 


23661 


59845 


KPN303134 


23662 


59846 


KPN303135 




391 





PCT/US02/09107 



DNA 


Protein 


Gene LocusID 


SeqID 


SeqID 




35669 


71853 

/ iOJJ 


SHA 102006 


35670 


71854 


SHA1 02007 

OX LTi X WAr \J\J 1 


35671 

JJU / X 


71855 


SHA102011 


35672 


71856 


SHA102018 


35673 


71857 


SHA102019 


35674 


71858 


SHA102021 


35675 


71859 


SHA102022 


35676 


71860 


SHA102023 


35677 


71861 


SHA102024 

will X X \J4* V/ A# r 


35678 

J JVJ 1 O 


71862 


SHA102028 


35679 
j ju t y 


71863 


SHA 102031 

OXXXV. X \J**\JJ X 


356R0 


71R64 


cua 102033 


35681 

JJvO X 


71865 


SHA1 02036 


3568? 


71866 


SHA 102040 

oxixv lUi \j*tv/ 


35683 


71867 


SHA 102044 


35684 


71868 


SHA102047 


35685 


71869 


SHA102048 


35686 


71870 


SHA 102049 


35687 


71871 


SHA102072 


35688 


71872 


SHA1 02078 


35689 

j JU07 


71873 

/ iO / J 


SHA 102088 


35690 


71874 


qua 102089 


35601 


71 875 

/ 1 o / J 


SHA 102097 

oxxrvxv/x.v/-' / 




71876 

/ lO/U 


SHA 102099 




71877 


SHA 102 124 

OX XXV X v/x« X *»*t 


35694 


71878 

/ 1 o / o 


SHA 102 128 

Oi LTV X Uli 1«iO 


^5605 


71R7Q 
/ io iy 


SHA10919Q 


35696 


71880 


SHA 1021 30 

Oxxxv l v^> 1JU 


35697 


71881 

/ 1 OO X 


SHA 102 132 


35698 


71882 


SHA 102 134 

oxxxv l \J£* x j*t 


35699 


71883 


SHA102135 

oxxxv xva<. ijj 




71RR4 

/ 1 00*T 


SHA 107 145 

JlxrllU^ itJ 


157M 

j j /ul 


71RR5 

/ lOOJ 


SHA10914R 




71886 


SHA 1021 50 

oxxxv x VJX l j V/ 


35703 
jj /v/j 


71RR7 


SHA 109 153 

OxxxVHJ.£ IJJ 


35704 

J J /U*T 


71888 


SHA102155 
oxxxv xvjx x j j 


35705 

J J f V/J 


71889 

/ 1007 


SHA 102 166 

Ui XXX X \J*t L\J\J 


35706 

J J / vu 


71890 


SHA102188 

OX XX*. 1 ujj X OO 


35707 


71891 

/ 1 07 1 


SHA 102 191 


35708 

Jj / vo 


71892 


SHA 102 193 


35709 
j j / \jy 


71893 

/ 107J 


SHA 102 199 

oiirvivXi L^y 


35710 

J J / XV/ 


71894 


SHA 102201 

OX XXV 1 \J1*£*\J X 


35711 
j j /li 


71895 
/ 1 oyj 


SHA 1022 13 

OXXXV X V/X»Z> X J 


35712 

J J / J.Z. 


71896 


SHA 10221 4 

OXXXV X UZ. Z. X H 


35713 

J J / X J 


71897 
/ x oy i 


SHA 10221 6 

OX XXV X \}£*J* X VJ 


35714 

J J 1 X*T 


71898 

/ X 0.70 


SHA 1022 17 
oxxxv x UX.Z. x / 


35715 


71899 

/ x oyy 


SHA 10221 8 

OXXXV. X X o 


35716 


71900 


SHA 102220 

OX XXV X x>v/ 


35717 


71901 


SHA 102221 

OX XXV 1 \J X 


35718 

JJ / io 


71902 

/ 1 y\j*~ 


SHA 109993 

OXXXV X VJZ.Z.X. J) 


35719 


71903 


SHA102232 


35720 


71904 


SHA102241 


35721 


71905 


SHA102245 


35722 


71906 


SHA102248 


35723 


71907 


SHA102255 



WO 02/077183 
DNA Protein Gene LocusID 
SeqID SeqID 



11601 


47785 


BCE107805 


11602 


47786 


BCE107836 


11603 


47787 


BCE107845 


11604 


47788 


BCE107846 


11605 


47789 


BCE107847 


11606 


47790 


BCE107854 


11607 


47791 


BCE107881 


11608 


47792 


BCE107896 


11609 


47793 


BCE 107905 


11610 


47794 


BCE 107987 


11611 


47795 


BCE 107990 


11612 


47796 


BCE 107994 


11613 


47797 


BCE108020 


11614 


47798 


BCE 108025 


11615 


47799 


BCE108028 


11616 


47800 


BCE108051 


11617 


47801 


BCE108053 


11618 


47802 


BCE108067 


11619 


47803 


BCE108070 


11620 


47804 


BCE 108074 


11621 


47805 


BCE108109 


11622 


47806 


BCE108121 


11623 


47807 


BCE108130 


11624 


47808 


BCE108135 


11625 


47809 


BCE108192 


11626 


47810 


BCE108240 


11627 


47811 


BCE108241 

X-J V/i^ X W U A* l X 


11628 


47812 


BCE108256 


11629 


47813 


BCE108257 


11630 


47814 


BCE108299 


11631 


47815 


BCE108358 


11632 


47816 


BCE108394 


11633 


47817 


BCE 108487 


11634 


47818 


BCE108504 


11635 


47819 


BCE108518 


11636 


47820 


BCE108533 


11637 


47821 


BCE108544 

X— ' VXw x vw«/ f I 


11638 


47822 


BCE 108546 


11639 


47823 


BCE1 08622 


11640 


47824 


BCE108649 


11641 


47825 


BCE108668 


11642 


47826 


BCE108675 


11643 


47827 


BCE108686 


11644 


47828 


BCE108691 


11645 


47829 


BCE1 08701 


11646 


47830 


BCE108719 


11647 


47831 


BCE 108732 


11648 


47832 


BCE108743 


11649 


47833 


BCE108751 


11650 


47834 


BCE108786 


11651 


47835 


BCE108792 


11652 


47836 


BCE108815 


11653 


47837 


BCE108826 


11654 


47838 


BCE108874 


11655 


47839 


BCE108880 



DNA 


Protein 


Gene LocusID 


SeqID 


SeqID 




23663 


59847 


KPN303137 


23664 


59848 


KPN303138 


23665 


59849 


KPN303140 
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It will be appreciated that ORFs may also be identified using databases other than 
PathoSeq. For example, the ORFs may be identified using the methods described in U.S. 
Provisional Patent Application Serial Number 60/191,078, filed March 21, 2000. 

EXAMPLE 4 

5 Transfer of Exogenous Nucleic Acid Sequences to other Bacterial Species 

The ability of an antisense molecule identified in a first organism to inhibit the 
proliferation of a second organism (thereby confirming that a gene in the second organism which is 
homologous to the gene from the first organism is required for proliferation of the second 
organism) was validated using antisense nucleic acids which inhibit the growth of E. coli which 

10 were identified using methods similar to those described above. Expression vectors which inhibited 
growth of E. coli upon induction of antisense RNA expression with IPTG were transformed directly 
into Enterobacter cloacae, Klebsiella pneumonia or Salmonella typhimurium. The transformed 
cells were then assayed for growth inhibition according to the method of Example 1. After growth 
in liquid culture, cells were plated at various serial dilutions and a score determined by calculating 

15 the log difference in growth for INDUCED vs. UNINDUCED antisense RNA expression as 
determined by the maximum 10 fold dilution at which a colony was observed. The results of these 
experiments are listed below in Table II. If there was no effect of antisense RNA expression in a 
microorganism, the clone is minus in Table II. In contrast, a positive in Table II means that at least 
10 fold more cells were required to observe a colony on the induced plate than on the non-induced 

20 plate under the conditions used and in that microorganism. 

TABLE n 



MoL No, 


S. typhimurium 


E. cloacae 


K pneumoniae 


EcXAOOl 


+ 


+ 




EcXA004 


+ 






EcXA005 


+ 


+ 


+ 


EcXA006 








EcXA007 




+ 




EcXA008 






+ 


EcXA009 








EcXAOlO 




+ 


+ 


EcXAOll 




4- 




EcXA012 




+ 




EcXA013 




+ 


+ 


EcXA014 


+ 


+ 




EcXAOlS 


+ 


+ 




EcXA016 


+ 


+ 


+ 


EcXA017 




+ 


+ 


EcXA018 


+ 


+ 


+ 


EcXA019 




+ 


+ 


EcXA020 . 


+ 


+ 


+ 


EcXA02l 


+ 


+ 


+ 


EcXA023 




+ 


+ 


EcXA024 


+ 




+ 
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EcXA025 


- 






EcXA026 


+ 




- 


EcXA027 


+ 


+ 


- 


EcXA028 


+ 


- 


- 


EcXA029 


- 


- 


- 


EcXA030 


+ 




+ 


EcXA031 




- 


- 


EcXA032 


+ 


+ 


- 


EcXA033 


+ 


+ ! 


+ 


EcXA034 


+ 


+ i 


+ 


EcXA035 


- 


- 


- 


EcXA036 


+ 


- 


+ 


EcXA037 


+ 




- 


EcXA038 


+ 


+ 


+ 


EcXA039 


+ 


- 


- 


EcXA041 


+ 




+ 


EcXA042 


- 


+ 


+ 


EcXA043 


- 


- 


- 


EcXA044 


- 


- 


- 


EcXA045 


+ • 




+ 


EcXA046 


- 


- 


- 


EcXA047 


+ 


+ 


- 


EcXA048 


- 


- 


- 


EcXA049 




- 


- 


EcXA050 


- 


- 


- 


EcXAOSl 


+ 


- 


- 


EcXA052 


+ 


- 


- 


EcXA053 


+ 


+ 


+ 


EcXA054 


- 


- 


+ 


EcXA055 




- 


- 


EcXA056 


+ 


- 


+ 


EcXA057 


+ 


+ 


- 


EcXA058 


- 


- 


- 


EcXA059 


+ 


+ 


+ 


EcXA060 


- 


- 


- 


EcXA061 


- 


- 


- 


EcXA062 


- 


- 


- 


EcXA063 


+ 


+ 


- 


EcXA064 


- 


- 


- 


EcXA065 


+ 


+ 


- 


EcXA066 


- 


- 


- 


EcXA067 


- 


+ 


- 


EcXA068 


- 


- 


- 


EcXA069 




+ 


- 


EcXA070 
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EcXA190 
EcXA191 
EcXA192 

Thus, the ability of an antisense nucleic acid which inhibits the proliferation of Escherichia 
coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas 
aeruginosa, Salmonella typhimurium, Acinetobacter baumannii, Bacillus anthracis, Bacteroides 
5 fragilis, Bordetella pertussis, Borrelia burgdorferi, BurkJiolderia cepacia, Burkholderia fungorum, 
Burkholderia mallei, Campylobacter jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, 
Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Corynebacterium 
diptheriae, Enterobacter cloacae, Enterococcus faecium, Haemophilus influenzae, Helicobacter 
pylori, Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium 

10 avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma 
genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella 
multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, 
Salmonella typhi, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
mutans, Sheptococcus pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 

15 urealyticum, Vibrio cholerae or Yersinia pestis to inhibit the growth of other organims may be 
evaluated by transforming the antisense nucleic acid directly into species other than the organism 
from which they were obtained. In particular, the ability of the antisense nucleic acid to inhibit the 
growth of Acinetobacter baumannii, Anaplasma marginale, Aspergillus fumigatus, Bacillus 
anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 

20 Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Candida albicans, Candida 
glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
guilliermondii, Candida krusei, Candida kefyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, 
Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Coccidioides immitis, 

25 Corynebacterium diptheriae, Cryptococcus neofonnans, Enterobacter cloacae, Enterococcus 
faecalis, Enterococcus faecium, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, 
Histoplasma capsulatum, Klebsiella pneumoniae, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 

30 pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pasteurella 
haemolytica, Pasteurella multocida, Pneumocystis carinii, Proteus mirabilis, Proteus vulgaris, 
Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas syringae, Salmonella bongori, 
Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 

35 Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
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urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulmficans, Yersinia enferocohtica, 
Yersinia pestis or any species falling within the genera of any of the above species may be 
evaluated. In some embodiments of the present invention, the ability of the antisense nucleic acid 
to inhibit the growth of an organism other than £. coli may be evaluated. In such embodiments, 
5 the antisense nucleic acids are inserted into expression vectors functional in the organisms in which 
the antisense nucleic acids are evaluated. 

It will be appreciated that the above methods for evaluating the ability of an antisense 
nucleic acid to inhibit the proliferation of a heterologous organism may be performed using 
antisense nucleic acids complementary to any of the proliferation-required nucleic acids from 

10 Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, 
Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter baumannii, Bacillus anthracis, 
Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Chlamydia pneumoniae, 
Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, 

15 Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus faecium, Haemophilus 
injluenzae, Helicobacter pylori, Legionella pneumophila, Listeria monocytogenes, Moraxella 
catarrhalis, Mycobacterium avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium 
tuberculosis, Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria 
meningitidis, Pasteurella multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas 

20 syringae, Salmonella paratyphi, Salmonella typhi, Staphylococcus epidermidis, Staphylococcus 
haemolyticus, Streptococcus mutans, Streptococcus pneumoniae, Streptococcus pyogenes, 
Treponema pallidum, Ureaplasma urealyticum, Vibrio cholerae or Yersinia pestis (including 
antisense nucleic acids complementary to SEQ ID NOs.: 6214-42397, such as the antisense nucleic 
acids of SEQ ID NOs.: 1-6213) or portions thereof, antisense nucleic acids complementary to 

25 homologous coding nucleic acids or portions thereof, or homologous antisense nucleic acids. 

Those skilled in the art will appreciate that a negative result in a heterologous cell or 
microorganism does not mean that that cell or microorganism is missing that gene nor does it mean that 
the gene is unessential However, a positive result means that the heterologous cell or microorganism 
contains a homologous gene which is required for proliferation of that cell or microorganism. The 

30 homologous gene may be obtained using the methods described herein. For example, the homologous 
gene may be isolated by performing a PCR procedure using primers based on the antisense sequence 
which reduced the level or activity of the gene product encoded by the homologous gene or by 
performing a Southern blot. Those cells that are inhibited by antisense may be used in cell-based 
assays as described herein for the identification and characterization of compounds in order to develop 

35 antibiotics effective in these cells or microorganisms. Those skilled in the art will appreciate that 
an antisense molecule which works in the microorganism from which it was obtained will not always 
work in a heterologous cell or microorganism. 
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EXAMPLE 5 

Transfer of Exogenous Nucleic Acid Sequences to Other Bact erial Species Using the Escherichia coli, 
Sta phylococcus aureus. Enterococ cus faecalis. Klebsiella pneumoniae. Pseudom onas aeruginosa. 
Salmonella tvphi?nurium. Acinetobacter baumannii. Bacillus anthracis, Bacteroi des fragilis. 
5 Bordetella pertussis. Borrelia burgdorferi, Burkholderia cepacia. Burkh olderia fungorum. 

Burkholderia mallei. Campylobacter ieiuni. Chlamydia pneumoniae. C hlamydia trachomatis. 
Clostridium acetobutvlicum. Clostridium botulinum. Clostridium difficile. Coryn ebacterium 
diptheriae. Enterobacter cloacae. Enterococcus faecium, Haemophilus influenzae. Helicobacter 
pylori. Legionella pneumophila. Listeria monocytogenes. Moraxella catarrhalis. M ycobacterium 
10 gyium. Mycobacterium boyis. Mycobacterium leprae. Mycobacterium tuberculosis. Mycoplasma 
genitalium. Mycoplasma pneumoniae. Neisseria gonorrhoeae. Neisseria meningitidi s. Pasteurella 
multocida. Proteus mirabilis. Pseudomonas putida. Pseudomonas svringae. Sal monella paratyphi. 
Salmonella typhi. Staphylococcus epidermidis, Staphylococcus haemolyticus. Streptococcus 
mutans. Streptococcus pneumoniae. Streptococcus pyogenes, Treponema pallidum. Ureaplasma 
15 urealyticum. Vibrio cholerae or Yersinia pestis Expression Vectors or Expression Vectors Functional 
in Bacterial Species Other Than the Foregoing Bacterial Species 
The antisense nucleic acids that inhibit the growth of Escherichia coli, Staphylococcus 
aureus, Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas aeruginosa, Salmonella 
typhimurium, Acinetobacter baumannii. Bacillus anthracis, Bacteroides fragilis, Bordetella 
20 pertussis, Borrelia burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkholderia 
mallei, Campylobacter jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium 
acetobutylicum, Clostridium botulinum, Clostridium difficile, Corynebacterium diptheriae, 
Enterobacter cloacae, Enterococcus faecium, Haemophilus influenzae, Helicobacter pylori, 
Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, 
25 Mycobacterium boyis, Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma 
genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella 
multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, 
Salmonella typhi, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
mutans, Streptococcus pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
30 urealyticum, Vibrio cholerae or Yersinia pestis , or portions thereof, may also be evaluated for their 
ability to inhibit the growth of cells or microorganisms other than Escherichia coli, Staphylococcus 
aureus, Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas aeruginosa, Salmonella 
typhimurium, Acinetobacter baumannii, Bacillus anthracis, Bacteroides fragilis, Bordetella 
pertussis, Borrelia burgdorferi, Burkholderia cepacia, Burkliolderia fungorum, Burkholderia 
35 mallei, Campylobacter jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium 
acetobutylicum, Clostridium botulinum, Clostridium difficile, Corynebacterium diptheriae, 
Enterobacter cloacae, Enterococcus faecium, Haemophilus influenzae, Helicobacter pylori, 
Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium avium, 
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Mycobacterium bovis, Mycobacterium leprae, Mycobacterium ^tuteYcutoiis^ 'Mycdp'Mma 
genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella 
multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, 
Salmonella typhi, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
5 mutans, Streptococcus pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
urealyticum, Vibrio cholerae or Yersinia pestis. For example, the antisense nucleic acids that 
inhibit the growth of Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella 
pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter baumannii, 
Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia 

10 cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Chlamydia 
pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium Botulinum, 
Clostridium difficile, Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus faecium, 
Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria monocytogenes, 
Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, Mycobacterium leprae, 

15 Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria 
gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus mirabilis, Pseudomonas 
putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, Staphylococcus 
epidermidis, Staphylococcus haemolyticus,- Streptococcus mutans, Streptococcus pneumoniae, 
Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio cholerae or 

20 Yersinia pestis may be evaluated for their ability to inhibit the growth of other organisms. In 
particular, the ability of the antisense nucleic acid to inhibit the growth of Acinetobacter baumannii, 
Anaplasma marginale, Aspergillus fumigatus, Bacillus anthracis, Bacteroides fragilis, Bordetella 
pertussis, Borrelia burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkholderia 
mallei, Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis 

25 glabrata), Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida krusei, 
Candida kefyr (also called Candida pseudotropicalis), Candida dubliniensis, Chlamydia 
pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium botulinum, 
Clostridium difficile, Clostridium perfringens, Coccidioides immitis, Corynebacterium diptheriae, 
Cryptococcus neoformans, Enterobacter cloacae, Enterococcus faecalis, Enterococcus faecium, 

30 Escherichia coli, Haemophilus influenzae, Helicobacter pylori, Histoplasma capsulatum, Klebsiella 
pneumoniae, Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, 
Mycobacterium avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, 
Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, 
Nocardia asteroides, Pasteurella haemolytica, Pasteurella multocida, Pneumocystis carinii, 

35 Proteus mirabilis, Proteus vulgaris, Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas 
syringae, Salmonella bongori, Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, 
Salmonella typhi, Salmonella typhimurium, Shigella boydii, Shigella dysenteriae, Shigella flexneri, 
Shigella sonnei, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus haemolyticus, 
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Streptococcus pneumoniae, Streptococcus mutans, Streptococcus pyogenes, Treponema pallidum, 
Ureaplasma urealyticum, Vibrio cholerae, Vibrio parahaemolyticus, Vibrio vulnificans. Yersinia 
enterocolitica, Yersinia pestis or any species falling within the genera of any of the above species 
may be evaluated. In some embodiments of the present invention, the ability of the antisense 
5 nucleic acid to inhibit the growth of an organism other than E. coli may be evaluated. 

In such methods, expression vectors in which the expression of an antisense nucleic acid 
that inhibits the growth of Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, 
Klebsiella pneumoniae, Pseudomonas aeruginosa. Salmonella typhimurium, Acinetobacter 
baumannii, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, 

10 Burkholderia cepacia, Burkholderia fimgorum, Burkholderia mallei, Campylobacter jejuni, 
Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium 
botulinum, Clostridium difficile, Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus 
faecium, Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 

15 Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus 
mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, 
Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus 
pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum. Vibrio 

20 choleraei or Yersinia pestisis under the control of an inducible promoter are introduced into the 
cells or microorganisms in which they are to be evaluated. In some embodiments, the antisense 
nucleic acids may be evaluated in cells or microorganisms which are closely related to Escherichia 
coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas 
aeruginosa, Salmonella typhimurium, Acinetobacter baumannii, Bacillus anthracis, Bacteroides 

25 fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, Burkholderia jungorum, 
Burkholderia mallei, Campylobacter jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, 
Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Corynebacterium 
diptheriae, Enterobacter cloacae, Enterococcus faecium, Haemophilus influenzae, Helicobacter 
pylori, Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium 

30 avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma 
genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella 
multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, 
Salmonella typhi, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
mutans, Streptococcus pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 

35 urealyticum, Vibrio cholerae or Yersinia pestis. The ability of these antisense nucleic acids to 
inhibit the growth of the related cells or microorganisms in the presence of the inducer is then 
measured. 

EXAMPLE 6 
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Identification of Nucleic Acids Homologous to Nucleic AciSs &q\rir^OTlb^^ 
Sta phylococcus aureus in other Ba cterial Species 
Nucleic acids homologous to proliferation-required nucleic acids from Staphylococcus aureus 
were identified as follows. For example, thirty-nine antisense nucleic acids which inhibited the 
5 growth of Staphylococcus aureus were identified using methods such as those described herein and 
were inserted into an expression vector such that their expression was under the control of a xylose- 
inducible Xyl-T5 promoter. A vector with a reporter gene under control of the Xyl-T5 promoter 
was used to show that expression from the Xyl-T5 promoter in Staphylococcus epidermidis was 
comparable to that in Staphylococcus aureus. 
10 The vectors were introduced into Staphylococcus epidermidis by electroporation as follows: 

Staphylococcus epidermidis was grown in liquid culture to mid-log phase and then harvested by 
centrifugation. The cell pellet was resuspended in 1/3 culture volume of ice-cold EP buffer (0.625 
M sucrose, 1 mM MgCl 2 , pH=4.0), and then harvested again by centrifugation. The cell pellet was 
then resuspended with 1/40 volume EP buffer and allowed to incubate on ice for 1 hour. The cells 
15 were then frozen for storage at -80°C. For electroporation, 50 uj of thawed electrocompetent cells 
were combined with 0.5 \i% plasmid DNA and then subjected to an electrical pulse of 10 kV/cm, 25 
uFarads, 200 ohm using a biorad gene pulser electroporation device. The cells were immediately 
resuspended with 200 ul outgrowth medium and incubated for 2 hours prior to plating on solid 
growth medium with drug selection to maintain the plasmid vector. Colonies resulting from 
20 overnight growth of these platings were selected, cultured in liquid medium with drug selection, 
and then subjected to dilution plating analysis as described for Staphylococcus aureus in Example 1 
above to test growth sensitivity in the presence of the inducer xylose. 

The results are shown in Table III below. The first column indicates the Molecule Number 
of the Staphylococcus aureus antisense nucleic acid which was introduced into Staphylococcus 
25 epidermidis. The second column indicates whether the antisense nucleic acid inhibited the growth 
of Staphylococcus epidermidis, with a indicating that growth was inhibited. Of the 39 
Staphylococcus aureus antisense nucleic acids evaluated, 20 inhibited the growth of Staphylococcus 
epidermidis. 

TABLE HI 

30 Sensitivity of Other Microorganisms to Antisense Nucleic Acids That Inhibit Proliferation of 



Staphylococcus aureus 



Mol.No. 


S. epidermidis 


SaXA005 


+ 


SaXA007 


+ 


SaXA008 


+ 


SaXA009 


+ 


SaXAOlO 


+ 


SaXAOll 
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SaXA012 




SaXAOD 




SaXA015 


+ 


SaXA017 


m 


SaXA022 


+ 


SaXA023 




SaXA024 




SaXA025 


+ 


SaXA026 


+ 


SaXA027 




SaXA027b 


_ 


SaXA02c 


m 


SaXA028 


m 


SaXA029 


+ 


SaXA030 


+ 


SaXA032 


+ 


SaXA033 


+ 


SaXA034 




SaXA035 


+ 


SaXA037 


+ 


SaXA039 




SaXA042 




SaXA043 




SaXA044 




SaXA045 


+ 


SaXA051 


+ 


SaXA053 




SaXA056b 




SaXA059a 


+ 


SaXA060 




SaXA061 


+ 


SaXA062 


+ 


SaXA063 




SaXA065 
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Although the results shown above were obtained" using V' sul&sef of ^iif^tiori^elqtiired 
nucleic acids from Staphylococcus aureus, it will be appreciated that similar analyses may be 
performed using the nucleic acids of the present invention to deteirnine whether they inhibit the 
proliferation of cells or microorganisms other than Escherichia coli, Staphylococcus aureus, 
5 Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium, 
Acinetobacter baumannii, Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia 
burgdorferi, Burkholderia cepacia, Burkholderia fungorum, Burkholderia mallei, Campylobacter 
jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium 
botulinum, Clostridium difficile, Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus 

10 faecium, Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria 
monocytogenes,, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus 
mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella typhi, 

15 Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus 
pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma ureafyticum, Vibrio 
cholerae or Yersinia pestis. 

Thus, it will be appreciated that the above methods for evaluating the ability of an antisense 
nucleic acid to inhibit the proliferation of a heterologous organism may be performed usjng 

20 antisense nucleic acids complementary to any of the proliferation-required nucleic acids from 
Escherichia coli, Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, 
Pseudomonas aeruginosa, Salmonella typhimurium, Acinetobacter baumannii, Bacillus anthracis, 
Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, 
Burkholderia fungorum, Burkholderia mallei, Campylobacter jejuni, Chlamydia pneumoniae, 

IS Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, 
Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus faecium, Haemophilus 
influenzae, Helicobacter pylori, Legionella pneumophila, Listeria monocytogenes, Moraxella 
catarrhalis, Mycobacterium avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium 
tuberculosis, Mycoplasma genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria 

30 meningitidis, Pasteurella multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas 
syringae, Salmonella paratyphi, Salmonella typhi, Staphylococcus epidermidis, Staphylococcus 
haemolyticus, Streptococctds mutans, Streptococcus pneumoniae, Streptococcus pyogenes, 
Treponema pallidum, Ureaplasma ureafyticum, Vibrio cholerae or Yersinia pestis, (including 
antisense nucleic acids complementary to SEQ ID NOs.: 6214-42397, such as the antisense nucleic 

35 acids of SEQ ID NOs.: 1-6213) or portions thereof, antisense nucleic acids complementary to 
homologous coding nucleic acids or portions thereof, or homologous antisense nucleic acids. 
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EXAMPLE 7 

identification of Homologous Nucle ic An'Hs bv Functional Complementation 
Homologous coding nucleic acids, homologous antisense nucleic acids or nucleic acids 
encoding homologous polypeptides may be identified as follows. Gene products whose activities 
5 may be complemented by a proliferation-required gene product from Escherichia coli, 
Staphylococcus aureus, Enterococcus faecalis, Klebsiella pneumoniae, Pseudomonas aeruginosa, 
Salmonella typhimurium, Acinetobacter baumannii, Bacillus anthracis, Bacteroides jragilis, 
Bordetella pertussis, Borrelia burgdorferi, Burkholderia cepacia, Burkholderia jungorum, 
Burkholderia mallei, Campylobacter jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, 

10 Clostridium acetobutylicum, Clostridium botulinum, Clostridium difficile, Corynebacterium 
diptheriae, Enterobacter cloacae, Enterococcus faecium, Haemophilus influenzae, Helicobacter 
pylori, Legionella pneumophila, Listeria monocytogenes, Moraxella catarrhalis, Mycobacterium 
avium, Mycobacterium bovis, Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma 
genitalium, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella 

15 multocida, Proteus mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, 
Salmonella typhi, Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus 
mutans, Streptococcus pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma 
ureafyticum % Vibrio cholerae, Yersinia pestis or homologous polypeptides are identified using 
merodiploids, created by introducing a plasmid or Bacterial Artificial Chromosome into an 

20 organism having a mutation in the essential gene which reduces or eliminates the activity of the 
gene product. In some embodiments, the mutation may be a conditional mutation, such as a 
temperature sensitive mutation, such that the organism proliferates under permissive conditions but 
is unable to proliferate under non-permissive conditions in the absence of complementation by the 
gene on the plasmid or Bacterial Artificial Chromosome. Alternatively, duplications may be 

25 constructed as described in Roth et al. (1987) Biosynthesis of Aromatic Amino Acids m 
Escherichia coli and Salmonella typhimurium y F. C. Neidhardt, ed., American Society for 
Microbiology, publisher, pp. 2269-2270. Such methods are familiar to those skilled in the art. 
Alternatively, homologous coding nucleic acids, homologous antisense nucleic acids or nucleic 
acids encoding homologous polypeptides may be identified by placing a gene required for 

30 proliferation or a nucleic acid complementary to at least a portion of a gene required for 
proliferation under the control of a regulatable promoter as described above, introducing a plasmid 
or Bacterial Artificial Chromosome into the cell, and identifying cells which are able to proliferate 
. under conditions which would prevent or reduce proliferation in the absence of the plasmid or 
Bacterial Artificial Chromosome. 

35 Homologous coding nucleic acids, homologous antisense nucleic acids or nucleic acids 

encoding homologous polypeptides may be identified using databases as follows. 
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EXAMPLE 8 " ~ " " — « — * — — * 
Identification of Homologous Nucleic Acid s bv Database Analysis 
As a demonstration of the methodology required to find homologies to an essential gene, 
fifty-one prokaiyotic organisms were analyzed and compared in detail. First, the most reliable 
5 source of gene sequences for each organism was assessed by conducting a survey of the public and 
private data sources. The fifty-one organisms studied are Escherichia coli, Staphylococcus aureus, 
Enterococcusfaecalis, Klebsiella pneumoniae, Pseudomonas aeruginosa, Salmonella typhimurium, 
Acinetobacter baumannii. Bacillus anthracis, Bacteroides fragilis, Bordetella pertussis, Borrelia 
burgdorferi, Burkholderia cepacia, Burkholderia fungorum, BurkJiolderia mallei, Campylobacter 
10 jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium acetobutylicum, Clostridium 
botulinum, Clostridium difficile, Corynebacterium diptheriae, Enterobacter cloacae, Enterococcus 
faecium, Haemophilus influenzae, Helicobacter pylori, Legionella pneumophila, Listeria 
monocytogenes, Moraxella catarrhalis, Mycobacterium avium, Mycobacterium bovis, 
Mycobacterium leprae, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma 
15 pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Proteus 
mirabilis, Pseudomonas putida, Pseudomonas syringae, Salmonella paratyphi, Salmonella, typhi, 
Staphylococcus epidermidis, Staphylococcus haemolyticus, Streptococcus mutans, Streptococcus 
pneumoniae, Streptococcus pyogenes, Treponema pallidum, Ureaplasma urealyticum, Vibrio 
cholerae and Yersinia pestis. Full-length gene protein and nucleotide sequences for these 
20 organisms were assembled from various sources. For Escherichia coli, Haemophilus influenzae 
and Helicobacter pylori, gene sequences were adopted from the public sequencing projects, and 
derived from the GenPept 115 database (available from NCBI). For Pseudomonas aeruginosa, 
gene sequences were adopted from the Pseudomonas genome sequencing project (downloaded from 
http://www.pseudomonas.com). For Klebsiella pneumoniae, Staphylococcus aureus, Streptococcus 
IS pneumoniae and Salmonella typhi, genomic sequences from PathoSeq v 4.1 (Mar 2000 release) 
were reanalyzed for ORFs using the gene finding software GeneMark v 2.4a, which was purchased 
from GenePro Inc. 451 Bishop St., N.W., Suite B, Atlanta, GA, 30318, USA. Similar analyses 
were conducted for the other organisms using publically available and proprietary databases. 

Homologous coding nucleic acids and the homologous polypeptides which they encode 
30 may be identified using a "reciprocal" best-hit analysis. To facilitate the identification of 
homologous coding nucleic acids and homologous polypeptides, paralogous genes within each of 
51 organisms were identified and clustered prior to comparison to other organisms. Briefly, the 
polypeptide sequence of each polypeptide encoded by each open reading frame (ORF) in a given 
organism was compared to the polypeptide sequence encoded by every other ORF for that organism 
35 for each of the 51 pathogenic organisms (PathoSeq Sept 2001 release) using BLASTP 2.09 
algorithm without filtering. Simultaneously, the polypeptide sequence encoded by each ORF of an 
organism was compared to the polypeptide sequences encoded by each of the ORFs in the 
remaining 51 organisms. Those polypeptides within a single organism that shared a higher degree 
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of sequence identity to one another than to polypeptide "sequences obtained any%feer 
organisms were clustered as "paralog" sequences for "reciprocal" best-hit analysis. 

For each reference organism, the 50 homologous coding nucleic acids (and the 50 
homologous polypeptides which they encode) was determined by identifying the ORFs in each of 
5 the 50 comparison organisms which encode a polypeptide sharing the highest degree of amino acid 
sequence identity to the polypeptide encoded by the ORF from the reference organism. The 
accuracy of the identification of the predicted homologous coding nucleic acids (and the 
homologous polypeptides which they encode) was confirmed by a "reciprocal" BLAST analysis in 
which the polypeptide sequence of the predicted homologous polypeptide was compared against the 

10 polypeptides encoded by each of the ORFS in the reference organism using BLASTP 2.09 
algorithm without filtering. Only those polypeptides that share the highest degree of amino acid 
sequence identity in each portion of the two-way comparison are retained for further analysis. 

The best homolog for each of the fifty-one organisms, defined as the most significantly 
scoring match which also fulfilled the above criteria, was reported in Table IV. 

15 Table IV lists the best ORF identified as described above (column labeled Homolog 

LocusID) that matches the query sequence (column labeled Query LocusBD), % identity between 
the query sequence and the homolog, and the amount of each sequence that aligns together well 
(columns labeled Query Coverage and Homolog Coverage) for the gene identified in each of the 
fifty-one organisms evaluated as described above. As described in connection which Table IC, the 

20 Locus IDs (ie. both Query Locus ID and Homolog Locus ID) provided m Table IV each comprise a 
nine digit alpha-numeric identifier that can be used to determine the organism from which the query 
and homolog sequences were obtained. Specifically, the first letter of the Locus ID corresponds to 
the first letter of the genus name of the organism described herein from which the Locus was 
identified and the second and third letters of the Locus ID correspond to the first two letters of the 

25 species name of this organism. For example, the identifier EFA205257 describes a gene locus 
identified from Enterococcus faecalis. In those instances where the three letter identifier is the 
same for different organisms, the exact identity of the organism which corresponds to the Locus ID 
can be determined by referring to the organism designation in the sequence listing for the coding 
nucleic acid or polypeptide SEQ ID NO. that corresponds to the particular Locus ID. 
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TABLE IV 



Query Locus ID 


Homolog Locus ED 


Identity 


Query Coverage 


Homolog Coverage 


ECO100001 


ECO100001 


100% 


100% 


100% 


ECO100001 


STY 104800 


85% 


100% 


100% 


ECO100001 


STM 104529 


85% 


100% 


100% j 


ECO 100002 


ABA101891 


29% 


40.9% 


80.3% 


ECO 100002 


BAN 108 126 


29% 


55.5% 


98.5% 


ECO100002 


BAN 105795 


30% 


55.5% 


95.8% 


ECO 100002 


BFR11028 


36% 


98.9% 


99.4% 


ECO 100002 


BPT101425 


30% 


41.5% 


81.7% 


ECO 100002 


BCE1 11633 


28% 


42.2% 


73.4% 


ECO100002 


BMA100957 


30% 


41.3% 


82.5% 


ECO100002 


CJU100543 


30% 


55.5% 


97.8% 


ECO100002 


CDF100035 


25% 


5.4% 


74.4% 


ECO 100002 


CDF100673 


25% 


54.9% 


96.7% 


ECO100002 


CDP101634 


28% 


46.1% 


95.7% 


ECO100002 


EBC103170 


92% 


70.7% 


100% 


ECO 100002 


EFA201386 


27% 


56.2% 


99.8% 


ECO100002 


ECO 100002 


100% ! 


100% 


100% 


ECO100002 


HIN100088 


62% 


99.3% 


99.8% 


ECO 100002 


HPY101212 


32% 


41.8% 


83.5% 


ECO100002 


KPN300246 


90% 


4.1% 


76.7% 


ECO100002 


KPN302085 


92% 


100% 


100% 


ECO 100002 


LMO102676 


27% 


57.3% 


100% 


ECO 100002 


MCA1 00888 


30% 


40.9% 


80.3% 


ECO 100002 


MAV103162 


26% 


55.5% 


94.8% 


ECO 100002 


MBV103584 


28% 


41.0% 


79.6% 


ECO100002 


MLP101383 


25% 


55.5% 


94.8% 


ECO100002 


MTU203655 


28% 


41.0% 


79.6% 


ECO 100002 


NGO100216 


30% 


8.9% 


4.2% 


ECO100002 


NME201560 


30% 


8.9% 


42% 


ECO100002 


PMU100113 


64% 


99.3% 


99.8% 


ECO100002 


PRT104264 


76% 


100% 


100% 


ECO100002 


PAE200903 


30% 


4.5% 


5.6% 


ECO 100002 


PPU106924 


31% 


40.9% 


80.3% 


ECO 100002 


PSY103112 


30% 


31.7% 


82.0% 


ECO 100002 


SPA100167 


93% 


39.5% 


100% 


ECO 100002 


STY100365 


94% 


100% 


100% 


ECO 100002 


STM100038 


94% 


100% 


100% 


ECO 100002 


SAU801327 


26% 


56.2% 


97.6% 


ECO100002 


SEP201977 


26% 


56.2% 


98.0% 


ECOl 00002 


SHA100755 


26% 


56.2% 


99.8% 


ECO 100002 


SMU100555 


25% 


56.6% 


100% 


ECO 100002 


SPN400374 


25% 


56.1% 


98.7% 1 


ECO100002 


VCH102329 


62% 


100% 


99.3% 


ECO100002 


YPS000769 


81% 


100% 


100% 


ECO100004 


BFR11030 


47% 


85.3% 


93.3% 


ECOl 00004 


BPT101337 


32% 


99.8% 


99.4% 


ECO100004 


BCE1 14764 


28% 


97.4% 


96.9% 


ECO100004 


BFU101149 


29% 


94.9% 


94.6% 


ECO100004 


-BMA105857 


30% 


90.4% 


95.0% 


ECOl 00004 


CJU1 00751 


29% 


90.2% 


87.2% 


ECO100004 


CAC103481 


28% 


99.5% 


95.8% 


ECOl 00004 


CBO101914 


30% 


79.9% 


86.5% 
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Query Locus ID 


Homolog Locus ID 


Identity 


Query Coverage 
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73% 


96.4% 


95.0% 


ECO 100009 


BFU100752 


72% 
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88.1% 


ECO 100009 


BMA109380 


72% 
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93.2% 


ECO 100009 


CJU100675 


50% 


95.9% 
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ECO100009 


CAC 100345 


41% 


79.5% 


90.3% 


ECO 100009 


CBO103886 


42% 


79.5% 
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ECO 100009 
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35% 
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88.6% 
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ECO100009 
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83.8% 
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ECO100009 


100% 


100% 


100% 
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BFR105539 


39% 


100% 


100% 


ECO100023 


BPT101291 


51% 


100% 


100% 


ECO100023 


BBU 100232 


38% 


92.0% 


74.8% 


ECO 100023 


BCE1 12030 


47% 


98.9% 


95.6% 


ECO100023 


BFU100359 


51% 


98.9% 


93.5% 


ECO100023 


BMA 106459 


48% 


98.9% 


93.5% 


ECO100023 


CJU101517 


34% 


100% 


100% 


ECO100023 


CPN201099 


31% 


94.3% 


82.8% 


ECO 100023 


CTR200896 


33% 


88.5% 


75.5% 


EC01 00023 


CAC102146 


45% 


98.9% 


97.7% 


ECO100023 


CBO100245 


39% 


98.9% 


97 7% 


ECO100023 


CDF10U97 


41% 


96.6% 


81.6% 


ECO100023 


CDP101196 


47% 


100% 


100% 


ECO100023 


EBC101439 


96% 


70.1% 


100% 


ECO100023 


EFA200336 


48% 


92.0% 


7Ji* fQ 


ECO100023 


EFM201993 


34% 


97.7% 


100% 

1 W fQ 


ECO100023 


ECO 100023 


100% 


100% 


100% 

1 W fQ 


ECO100023 


HIN 100944 


72% 


100% 


07 R% 
y / .o /o 


ECO100023 


HPY100074 


33% 


100% 


97.8% 


ECO100023 


KPN205579 


98% 


100% 


100% 


ECO100023 


LMO100322 


46% 


97.7% 


100% 


ECO100023 


MCA1 00794 


56% 


98.9% 


97.7% 


ECO100023 


MAV103835 


43% 


98.9% 


100% 


ECO100023 


MBV104427 


41% 


98.9% 


97.7% 


ECOl 00023 


MLP100381 


41% 


98.9% 


100% 


ECO100023 


MTU202376 


43% 


98.9% 


100% 
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Query Locus ID 


Homolog LocusID 


Identity 


Query Coverage 


Homolog Coverage 


ECO100023 


MGE100374 


30% 


9X7% 


88.6% 


ECO100023 


MPN100301 


31% 


97.7% 


89.7% 


ECO 100023 


NGO101642 


57% 


100% 


100% 


ECO100023 


NME201868 


57% 


100% 


100% 


ECO100023 


PMU101659 


78% 


100% 


97.8% 


ECO100023 


PRT102018 


83% 


98.9% 


100% 


ECO100023 


PAE204561 


59% 


100% 


95.6% 


ECO100023 


PPU108230 


59% 


98.9% 


93.5% 


ECO100023 


PSY102689 


58% 


100% 


94.6% 


ECO100023 


SPA101944 


97% 


100% 


100% 


ECO100023 


STY101095 


97% 


100% 


100% 


ECO100023 


SAU801586 


40% 


90.8% 


94.0% 


ECO100023 


SEP203256 


37% 


90.8% 


90.7% 


ECO100023 


SHA102295 


39% 


90.8% 


94.0% 


ECO100023 


SMU1 02606 


41% 


97.7% 


92.9% 


ECO100023 


SPN400740 


43% J 


97.7% 


94.0% 


ECOl 00023 


SPY200946 


45% 


95.4% 


98.7% 


ECO100023 


UUR100312 


40% 


88.5% 


83.3% 


ECO100023 


VCH100666 


70% 


98.9% 


100% 


ECO100023 


YPS003605 


87% j 


98.9% 


98.9% 


ECO100025 


ABA101842 


41% 


97.8% 


97.3% 


ECO 100025 


BAN106284 


28% 


98.1% 


99.4% 


ECO100025 


BAN101499 


35% 


99.4% 


97 2% 


ECO 100025 


BFR100412 


36% 


98.4% 


98.1% 


ECO100025 


BPT1 00434 


47% 


93.3% 


93.1% 


ECO100025 


BCE1 09423 


47% 


98.1% 


89.0% 


ECO100025 


BFU101232 


45% 


98.1% 


92.7% 


ECO100025 


BMA109546 


46% 


98.1% 


89.0% 


ECO100025 


CJU100550 


26% 


91.4% 


91.9% 


ECO 100025 


CPN200431 


31% 


92.0% 


90.9% 


ECO100025 


CTR200362 


32% 


91.1% 


93.9% 


ECO100025 


CAC102490 


31% 


89.8% 


92.4% 


ECO100025 


CBO 102062 


31% 


86.3% 


97.8% 


ECO100025 


CDF100874 


34% 


95.5% 


94.8% 


ECO100025 


CDP100453 


30% 


94.9% 


90.1% 


ECO100025 


EBC100241 


84% 


56.9% 


98.9% 


ECO100025 


EFA200587 


31% 


94.9% 


94.3% 


ECO100025 


EFM201219 


33% 


97.1% 


97.4% 


ECOl 00025 


ECO100025 


100% 


100% 


100% 


ECO100025 


HIN100942 


53% 


98.1% 


98.1% 


ECO100025 


HPY101070 


30% 


91.4% 


93.6% 


ECO100025 


KPN300577 


. 86% 


91.1% 


100% 


ECO100025 


LPN101116 


45% 


97.8% 


92.7% 


ECO100025 


LMO100570 


34% 


92.3% 


92.0% 


ECO100025 


MCA100302 


41% 


93.3% 


93.8% 


ECO100025 


MAV100301 


35% 


92.0% 


93.5% 


ECO100025 


MBV101247 


35% 


91.7% 


98.4% 


ECO100025 


MLP100513 


36% 


91.7% 


912% 


ECO100025 


MTU202748 


35% 


92.0% 


91.5% 


ECO100025 


MGE100147 


28% 


89.5% 


92.9% 


ECO100025 


MPN100673 


27% 


92.0% 


96.7% 


ECO100025 


NGO101768 


47% 


97.1% 


95.3% 


ECO100025 


NME200577 


47% 


97.1% 


95.6% 


ECO100025 


PMU101661 


58% 


98.1% 


98.4% 
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WO 02/077183 PCT/US02/09107 



Query LocuslD 


Homolog LocuslD 


Identity 


Ouery Coverage 




ECO 100025 


PRT105259 


67% 


97.8% 


97.8% 


ECO100025 


PAE204559 


55% 


98.1% 


98.1% 


ECO100025 


PPU105777 


52% 


97.1% ! 


95.9% 


ECO 100025 


PSY102686 


52% 


97.1% 


99.7% 


ECO 100025 


SPA101946 


88% 


98.7% 


100% 


ECO100025 


STY101100 


89% 


98.7% 


98.7% 


ECO100025 


STM100815 


89% 


98.7% 


98.7% 


ECO100025 


SAU801272 


34% 


97.1% 


95.4% 


ECO 100025 


SEP201565 


35% 


98.7% 


96.9% 


ECO100025 


SHA100170 


35% 


97.1% 


95.4% 


ECO100025 


SMU 100956 


36% 


91.4% 


92.5% 


ECO100025 


SPN401017 


33% 


92.0% 


93.4% 


ECO100025 


SPY200960 


32% 


96.8% 


96.1% 


ECO100025 


TPA100878 


30% 


69.6% 


80% 


ECO100025 


UUR100357 


26% 


87.2% 


86.9% 


ECO100025 


VCH100668 


58% 


97.1% 


94.1% 


ECOl 00025 


YPS000809 


75% 


99.7% 


99.7% 


ECO100026 


ABA103852 


54% 


99.8% 


100% 


ECO100026 


BAN106248 


37% 


92.3% 


96.1% 


ECO100026 


BAN106209 


43% 


86.5% 


99.1% 


ECO 100026 


BFR10500 


25% 


81.1% 


79.2% 


ECO 100026 


BPT1 00432 


51% 


99.5% 


99.6% 


ECO100026 


BBU100832 


26% 


81.8% 


74.0% 


ECO100026 


BCE1 11670 


51% 


95.3% 


97.0% 


ECO100026 


BFU101230 


51% ' 


993% 


98.4% 


ECO100026 


BMA102496 


43% 


98.5% 


94.9% 


ECO100026 


BMA109115 


50% 


99.6% 


98.7% 


ECO 100026 


CJU100989 


40% 


963% 


97.4% 


ECO100026 


CPN200653 


26% 


81.7% 


73.5% 


ECO100026 


CTR200283 


27% 


81.0% 


73.4% 


ECO100026 


CAC102765 


27% 


90.5% 


81.7% 


ECO 100026 


CBO100873 


30% 


55.1% 


96.8% 


ECO 100026 


CDF104554 


29% 


80.0% 


72.4% 


ECO 100026 


CDP101247 


28% 


89.1% 


80.5% 


ECO 100026 


EBC102151 


92% 


98.2% 


100% j 


ECO100026 


EFA202160 


44% 


98.1% 


97.8% 


ECO100026 


EFM201425 


44% 


98.5% 


98.1% 


ECO 100026 


ECO100026 


100% 


100% 


100% 


ECO 100026 


HPY101401 


39% 


98.5% 


992% 


ECO100026 


KPN300290 


95% 


4.7% 


100% 


ECO100026 


KPN306610 


92% 


98.3% 


100% 


ECO 100026 


LPN1 01832 


55% 


99.9% 


99.9% 


ECO100026 


LMO101679 


43% 


98.7% 


99.2% 


ECO100026 


MCA103671 


52% 


0.9% 


16.5% 


ECO100026 


MAV103388 


27% 


83.7% 


74.6% 


ECO100026 


MBV102555 


28% 


823% 


75.3% 


ECO 100026 


MLP 100743 


24% 


89.0% 


80.8% 


ECO100026 


MTU201515 


28% 


82.3% 


75.3% 


ECO100026 


MGE100354 


32% 


98.3% 


99.4% 


ECO100026 


MPN100322 


33% 


89.3% 


95.7% 


ECO 100026 


NGO101799 


53% 


99.7% 


100% 


ECO100026 


NME200578 


53% 


100% 


100% 


ECO100026 


PMU101662 


69% 


99.8% 


99.7% 


ECO100026 


PRT101657 


80% 


100% 


100% 



533 





Homolog LocusID 


Identity 


Query Coverage 


Homolog Coverage 


ECOl 00026 


PAE204558 


57% 


100% 


100% 


ECOl 00026 


PPU109898 


56% 


100% 1 


100% 


ECOl 00026 


PSY 102684 


54% 


98.3% 


100% 


ECO 100026 


SPA100667 


85% 


82.8% 


99.5% 


ECO 100026 


STY101103 


86% T 


100% 


100% 


ECO 100026 


STM100818 


87% 


100% 


100% 


ECO 100026 


SAU801193 


42% 


98.3% 


99.0% 


ECO100026 


SEP200928 


42% 


98.3% 


98.7% 


ECO100026 


SHA101034 


41% 


98.2% 


99.5% 


ECOl 00026 


SMU100109 


40% 


98.5% 


98.1% 


ECO100026 


SPN401501 


41% 


98.5% 


98.1% 


ECO100026 


SPY201164 


42% 


95.8% 


95.2% 


ECO100026 


TPA 100448 


26% 


89.6% 


81.1% 


ECO100026 _^ 


UUR100414 


36% 


97.0% 


97.8% 


ECO100026 


VCH100669 


67% 


100% 


100% 


ECO 100026 


YPS000810 


83% 


100% 


100% 


ECO100032 


BFR102496 


39% 


97.6% 


100% 


ECO100032 


BCE1 14725 


66% 


97.1% 


94.6% 


ECO100032 


BFU101987 


64% 


97.1% 


90.0% 


ECO100032 


BMA105964 


64% 


97.1% 


94.6% 


ECO100032 


CJU101410 


40% 


96.9% 


99.5% 


ECO 100032 


CAC100686 


41% 


96.3% 


99.1% 


ECO100032 


CDF102200 


42% 


96.3% 


L 98.8% 


ECO 100032 


CDF101392 


47% 


97.1% 


95.3% 


ECO 100032 


CDP100339 


45% 


982% 


94.4% 


ECO 100032 


EBC1 00940 


91% 


51.8% 


100% 


ECO 100032 


EFA201091 


45% 


97.9% 


98.6% 


ECO100032 


EFM201379 


47% 


96.9% 


97.8% 


ECO100032 


ECO100032 


100% 


100% 


100% 


ECO100032 


HPY101220 


38% 


97.1% 


99.2% 


ECO100032 


KPN306459 


80% 


100% 


100% 


ECO 100032 


LPN100982 


58% 


97.6% 


99.2% 


ECO 100032 


LMO100114 


46% 


96.6% 


96.1% 


ECO100032 


MCA1 01274 


62% 


96.6% 


95.9% 


ECO 100032 


MAV103355 


43% 


98.2% 


99.2% 


ECO100032 


MBV100477 


44% 


57.6% 


97.3% 


ECO 100032 


MLP100326 


42% 


98.7% 


100% 


ECO100032 


MTU201366 


45% 


97.6% 


98.7% 


ECO100032 


NGO101672 


67% 


96.1% 


96.6% 


ECO 100032 


NME200564 


67% 


97.1% 


97.6% 


ECO 100032 


PMU101502 


65% 


97.1% 


95.3% 


ECOl 00032 


PRT101652 


81% 


99.2% 


97.9% 


ECO100032 


PAE204754 


71% 


99.0% 


100% 


ECO100032 


PPU 108522 


70% 


89.8% 


100% 


ECO100032 


PSY107482 


68% 


64.4% 


100% 


ECO 100032 


SPA101379 


86% 


100% 


100% 


ECO100032 


STY101157 


93% 


100% 


100% 


ECO 100032 


STM100872 


94% 


100% 


100% 


ECO100032 


SAU801202 


45% 


97.6% 


96.4% 


ECO100032 


SEP102450 


43% 


99.2% 


98.1% 


ECO 100032 


-SHA102129 


44% 


99.2% 


98.1% 


ECO 100032 


SMU1 00787 


43% 


97.6% 


97.2% 


ECO100032 


SPN401153 


43% 


98.2% 


98.6% 


ECO100032 


SPY200619 


43% 


98.7% 


98.9% 
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Query LocusID 


Homolog LocusID 


Identity 


Ouery Coverage 


Homolog Coverage 


ECO100032 


VCH102355 


82% 


99.0% 


99.7% 


ECO100032 


YPS000843 


86% 


99.7% 


97.4% 


ECO100033 


ABA105447 


76% 


86.5% 


51.8% 


ECO100033 


BAN 106466 


36% 


92.5% 


93.4% 


ECO100033 


BAN106013 


50% 


97.2% 


73.8% 


ECO100033 


BFR11696 


39% 


96.3% 


962% 


ECO100033 


BFR10760 


40% 


39.0% 


91.4% 


ECO100033 


BPT101199 


69% 


97.1% 


73.7% 


ECO100033 


BCE101906 


68% 


99.9% 


99.9% 


ECO 100033 


BFU1 14502 


68% 


99.9% 


99.9% 


ECO 100033 


BMA100194 


68% 


99.9% 


99.9% 


ECO100033 


CJU100250 


55% 


100% 


99.9% 


ECO100033 


CAC100380 


46% 


100% 


100% 


ECO100033 


CDF103346 


45% 


68.9% 


81.0% 


ECO100033 


CDF103290 


48% 


97.4% 


97.7% 


ECO100033 


CDF102440 


47% 


97.4% 


97.7% 


ECO100033 


CDP100338 


52% 


98.4% 


982% 


ECO100033 


EBC103143 


96% 


4.1% 


77.8% 


ECO100033 


EFA201093 


49% 


98.6% 


99.4% 


ECO100033 


EFM200554 


48% 


98.6% 


99.4% 


ECO100033 


ECO100033 


100% 


100% 


100% 


ECO 100033 


HPY100903 


53% 


99.4% 


99.7% 


ECO100033 


KPN306727 


95% 


92.1% 


92.8% 


ECO100033 


LPN103124 


72% 


90.9% 


74.6% 


ECO100033 


LMO101147 


50% 


98.8% 


98.5% 


ECO100033 


MCA101278 


70% 


99.7% 


99.9% 


ECO100033 


MAV1 03356 


51% 


99.9% 


99.4% 


ECO100033 


MBV100481 


52% 


99.9% 


99.6% 


ECO100033 


MLP 100327 


51% 


99.9% 


98.3% 


ECO100033 


MTU301470 


52% 


99.9% 


99.6% 


ECO 100033 


NGO101630 


69% 


100% 


99.7% 


ECO100033 


NME200558 . 


69% 


100% 


99.7% 


ECO100033 


PMU101505 


69% 


99.9% 


99.9% 


ECO 100033 


PRT101651 


90% 


100% 


99.8% 


ECO100033 


PAE204752 


76% 


100% 


99.9% 


ECO 100033 


PPU105037 


74% 


100% 


99.6% 


ECO100033 


PSY104746 


75% 


100% 


99.9% 


ECO100033 


SPA101378 


93% 


87.4% 


63.3% 


ECOl 00033 


STY101163 


98% 


100% 


99.8% 


ECO100033 


STM100875 


98% 


100% 


99.8% 


ECO100033 


SAU801203 


49% 


99.5% 


100% 


ECO100033 


SEP201467 


52% 


86.8% 


35.4% 


ECO100033 


SHA102130 


49% 


99.5% 


100% 


ECO100033 


SMU100789 


50% 


98.4% 


99.2% 


ECO100033 


SPN401152 


49% 


98.4% 


99.2% 


ECO100033 


SPY200620 


49% 


98.4% 


99.2% 


ECO 100033 


VCH102354 


85% 


100% 


99.6% 


ECO100033 


YPS000846 


92% 


100% 


99.6% 


ECO100040 


ECO100040 


100% 


100% 


100% 


ECO100040 


MAV102307 


26% 


22.0% 


442% 


ECO100040 


PRT1 05337 


87% 


100% 


100% 


ECO 100040 


SPA102043 


92% 


54.8% 


100% 


ECO100040 


STY101185 


96% 


100% 


99.8% 


ECO100068 


BPT100719 


25% 


79.8% 


77.4% 



535 



WO 02/077183 # J* C ™^2/WW7^ 



Query Locus ED 


Homolog Locus ED 


Identity 


Query Coverage 


Tlomolog Coverage J 


ECO100068 1 


BCE1 10446 


23% 


97.2% 


89.3% 


ECO100068 


BFU104253 


25% 


59.3% 


49.6% 


ECO100068 


CJU100163 


23% 


81.0% 


82.0% 


ECO100068 


EBC103044 


90% 


96.0% 


97.5% 


ECOl 00068 


ECO100068 


100% 


100% 


100% 


ECO100068 


HIN100998 


50% 


99.1% 


99.7% 


ECO100068 


KPN303934 


83% 


100% 


100% 


ECO100068 


MCA100722 


24% 


85.6% 


85.2% 


ECO100068 


NGO101933 


30% 


97.9% 


97.3% 


ECO100068 


NME200270 


29% 


97.9% 


97.3% 


ECO100068 


PMU100376 


49% 


95.4% 


94.3% 


ECO100068 


PRT1 00202 


66% 


93.9% 


94.5% 


ECO100068 


PAE204684 


26% 


81.7% 


79.7% 


ECO100068 


PSY105429 


22% 


88.4% 


87.8% 


ECO 100068 


SPA101253 


89% 


100% 


100% 


ECO 100068 


STY101259 


91% 


100% 


100% 


ECO100068 


TPA100143 


31% 


91.4% 


90.7% 


ECO100068 


VCH102502 


65% 


97.6% 


97.0% 


ECO100068 


YPS000940 


72% 


98.8% 


97.9% 


ECO100069 


BAN110716 


37% 


7.8% 


66.2% 


ECO100069 


BAN1 10657 


27% 


12.0% 


60.6% 


ECO100069 


BAN101932 


26% 


57.5% 


56.3% 


ECO100069 


EBC103045 


79% 


100% 


100% 


ECO100069 


ECO100069 


100% 


100% 


100% 


ECO 100069 


KPN303932 


78% 


100% 


100% 


ECO100069 


LMO1015.04 


24% 


94.4% 


91.4% 


ECO100069 


PRT103304 


47% 


54.3% 


100% 


ECO100069 


SPA101254 


81% 


99 5% 


100% 


ECOl 00069 


STY101261 


86% 


100% 


100% 


ECO100069 


STM100598 


38% 


99.5% 


99.7% 


ECO100069 


VCH103302 


33% 


99.5% 


99.1% 


ECO100069 


YPS000941 


62% 


99.8% 


99.8% 


ECO100081 


BFR12308 


32% 


53.9% 


68.6% 


ECO 100081 


BPT103032 


44% 


87.5% 


90.8% 


ECO 100081 


BCE 103942 


47% 


82.9% 


85.9% 


ECO100081 


BFU100820 


48% 


82.2% 


85.2% 


ECO100081 


BMA105750 


47% 


82.2% 


85.2% 


ECO100081 


CAC103463 


29% 


100% 


100% 


ECO100081 


CBO101902 


31% 


91.4% 


85.9% 


ECO100081 


CDP101285 


34% 


93.4% 


95.8% 


ECO100081 


EBC101941 


95% 


78.9% 


100% 


ECO100081 


EFA202186 


34% 


88.2% 


92.3% 


ECO100081 


EFM202534 


39% 


82.2% 


83.9% 


ECO100081 


ECO100081 


100% 


100% 


100% 


ECO100081 


HIN101103 


59% 


100% 


100% 


ECO100081 


KPN301855 


93% 


78.9% 


100% 


ECO100081 


LPN102025 


43% 


100% 


100% 


ECO100081 


LMO101121 


36% 


96.7% 


96.5% 


ECO100081 


MAV103918 


32% 


94.1% 


99.3% 


ECO100081 


MBV101485 


31% 


94.1% 


99.3% 


ECO100081 


MLP 100560 


32% 


94.1% 


99.3% 


ECO 100081 


MTU202132 


31% 


94.1% 


99.3% 


ECO100081 


MGE1 00226 


27% 


98.0% 


90.3% 


ECO100081 


MPN100522 


28% 


98.7% 


99.3% 
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Query Locus ID 


Homolog LocusID 


Identity 


Ouery Coverage 


Homolog Coverage 


ECO100081 


NGO100621 


40% 


91.4% 


93.4% 


ECO100081 


NME201917 


40% 


91.4% 


93.4% 


ECO100081 


PMU100133 


55% 


100% 


100% 


ECO100081 


PRT102634 


77% 


100% 


100% 


ECO100081 


PAE204419 


51% 


100% 


100% 


ECO100081 


PPU1 05962 


50% 


100% 


100% 


ECO100081 


PSY103859 


52% 


100% 


100% 


ECO100081 


SPA102750 


93% 


100% 


100% 


ECO100081 


STY103147 


93% 


100% 


100% 


ECO100081 


STM102859 


94% 


100% 


100% 


ECOl 00081 


SAU801178 


33% 


98.0% 


97.9% 


ECO100081 


SEP200891 


32% 


98.0% 


97.9% 


ECOl 00081 


SHA101307 


33% 


98.0% 


97.9% 


ECO100081 


TPA100379 


34% 


99.3% 


96.6% 


ECO100081 


UUR100388 


25% 


96.7% 


96.6% 


ECO100081 


YPS000986 


84% 


100% 


100% 


ECO 100093 


ABA100624 


24% 


88.0% 


82.7% 


ECO100093 


BAN1 13055 


20% 


68.1% 


70.9% 


ECO100093 


BAN105429 


21% 


68.1% 


74.2% 


ECO100093 


BPT102987 


28% 


89.1% 


94.5% 


ECO100093 


BCE102722 


30% 


75% 


82% 


ECO100093 


BFU100406 


28% 


81.9% 


88% 


ECOl 00093 


BMA104087 


29% 


75% 


82% 


ECO100093 


CDP101255 


29% 


22.8% 


3U% 


ECO100093 


EBC101799 


85% 


100% 


100% j 


ECO100093 


ECO100093 


100% 


100% 


100% 


ECO100093 


HIN101115 


39% 


83.7% 


86.6% 


ECO100093 


KPN301845 


88% 


100% 


100% 


ECO100093 


LPN103526 


29% 


85.1% 


95.0% 


ECO 100093 


MCA 100437 


24% 


66.3% 


74.6% 


ECO100093 


MAV103267 


23% 


43.8% 


55.3% 


ECO 100093 


MLP100571 


24% 


34.4% 


27.9% 


ECO 100093 


NGOl 00580 


34% 


85.9% 


94.6% 


ECO 100093 


NME201903 


34% 


85.9% 


94.6% 


ECO100093 


PMU100145 


37% 


87.0% 


93.4% 


ECO100093 


PRT102631 


61% 


93.8% 


97.7% 


ECO100093 


PAE204407 


31% 


74.6% 


70.0% 


ECO100093 


PPU105995 


32% 


90.9% 


87.2% 


ECO100093 


PSY103839 


33% 


83.7% 


813% 


ECO100093 


SPA101845 


93% 


100% 


100% 


ECO100093 


STY103179 


93% 


100% 


100% 


ECO100093 


STM102891 


93% 


100% 


100% 


ECO100093 


SEP200899 


22% 


62.7% 


39.8% 


ECO100093 


SHA1 00844 


23% 


62.0% 


47.2% 


ECO 100093 


VCH102364 


39% 


88.0% 


90.4% 


ECO100093 


YPS001020 


66% 


97.5% 


99.6% 


ECO 100094 


ABA100623 


30% 


96.7% 


95.7% 


ECO100094 


BAN1 11291 


31% 


94.3% 


89.7% 


ECO 100094 


BAN101900 


32% 


93.6% 


88.2% 


ECO 100094 


BFR102146 


24% 


79.8% 


68.3% 


ECO 100094 


BPT102984 


45% 


97.4% 


98.0% 


ECO100094 


BBU100299 


35% 


98.6% 


99.3% 


ECO 100094 


BCE1 09663 


46% 


98.6% 


98.8% 


ECO100094 


BFU 100407 


46% 


98.6% 


98.8% 
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Query Locus ID 


Homolog LocusID 


Identity 


Query Coverage 


Homolog Coverage 


ECOI00094 


BMA104126 


47% 


98.6% 


98.8% 


, ECO100094 


CJU100645 


27% 


89.8% 


80.3% 


ECO100094 


CAC100288 


26% 


89.0% 


86.6% _ n 


ECO100094 


CBO102974 


28% 


89.0% 


85.0% 


ECOl 00094 


EBC101798 


98% 


100% 


100% 


ECO100094 


EFA202170 


33% 


90.5% 


83.6% 


ECO100094 


EFTV1201888 


32% 


89.8% 


82.8% 


ECO 100094 


ECO100094 


100% 


100% 


100% 


ECO100094 


HIN101116 


52% 


100% 


100% 


ECO100094 


HPY 100962 


25% 


92.9% 


83.9% 


ECO100094 


KPN300531 


96% 


58.3% 


100% 


ECOl 00094 


LPN103001 


44% 


100% 


100% 


ECOl 00094 


LMO101001 


32% 


95% 


89.7% 


ECO100094 


MCA100438 


25% 


82.4% 


78.9% 


ECO100094 


NGO100578 


41% 


97.6% 


98.1% 


ECO100094 


NME201902 


42% 


97.1% 


97.6% 


ECO100094 


PMU100146 


50% 


100% 


100% 


ECO 100094 


PRT102629 


89% 


100% 


100% 


ECO100094 


PAE204406 


51% 


93.6% 


92.6% 


ECO100094 


PPU1 11774 


52% 


90% 


90.0% 


ECO100094 


PSY103837 


52% 


91.7% 


92.2% 


ECO100094 


SPA101844 


97% 


100% 


100% 


ECO100094 


STY103209 


99% 


100% 


100% 


ECOl 00094 


STM102892 


99% 


100% 


100% 


ECO100094 


SAU801185 


24% 


88.8% 


76.6% 


ECO 100094 


SEP200915 


25% 


88.8% 


77.6% 


ECO 100094 


SHA 100845 


25% 


83.6% 


71.9% 


ECO100094 


SMU100084 


31% 


89.0% 


79.9% 


ECO100094 


SPN401510 


33% 


83.3% 


74.8% 


ECO100094 


SPY201171 


31% 


89.0% 


79.7% 


ECO100094 


TPA1 00385 


33% 


93.3% 


95.9% 


ECOl 00094 


VCH102363 


68% 


100% 


100% 


ECO100094 


YPS001022 


95% 


100% 


100% 


ECOl 00095 


ABA100622 


49% 


95.8% 


93.9% 


ECO100095 


BAN103024 


47% 


96.9% 


96.4% 


ECO100095 


BAN 112439 


48% 


96.9% 


96.4% 


ECO100095 


BFR12421 


43% 


84.1% 


74.1% 


ECO100095 


BPT102981 


53% 


97.1% 


96.4% 


ECO100095 


BBU100298 


47% 


97.1% 


92.3% 


ECO100095 


BCE103310 


58% 


75.7% 


100% 


ECO100095 


BFU 100408 


52% 


98.2% 


97.7% 


ECO100095 


BMA104500 


52% 


99.7% 


99.5% 


ECO100095 


CJU100646 


40% 


100% 


98.9% 


ECO100095 


CAC100838 


50% 


96.9% 


95.7% 


ECO100095 


CBO 103263 


49% 


96.9% 


96.5% 


ECO100095 


CDF1 02447 


48% 


98.4% 


96.2% 


ECO100095 


CDP101254 


51% 


84.6% 


77.9% 


ECO100095 


EBC101797 


97% 


41.5% 


97.5% 


ECO100095 


EFA202168 


51% 


90.1% 


84.1% 


ECO100095 


EFM200220 


54% 


82.0% 


75.8% 


ECO100095 


-ECO 100095 


100% 


100% 


100% 


ECO 100095 


HIN101U7 


63% 


79.9% 


77.9% 


ECO100095 


HPY100963 


40% 


79.4% 


79.5% 


ECOl 00095 


KPN300530 


98% 


41.8% 


100% 
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Query LocusID 


Homolog LocusID | Identity 


Query Coverage 


Homolop Coverage 


ECOT00095 


LPN 100473 


71% 


9.4% J 


100% 


ECO100095 


LMO101235 


46% 


96.9% 


96.2% 


ECO100095 


MCA101528 


55% 


85.9% 


91.9% 


ECO100095 


MAV103265 


47% 


96.6% 


97.7% 


ECO100095 


MBV100533 


53% 


79.4% 


79.9% 


ECO100095 


MLP100572 


45% 


96.6% 


97.9% 


ECO100095 


MTU202116 


53% 


79.4% 


79.9% 


ECO100095 


-MGE100229 


34% 


54.6% 


55.6% 


ECO100095 


MPN100519 


32% 


54.0% 


53.4% 


ECO 100095 


NGO100577 


48% 


96.9% 


95.2% 


ECO100095 


NME201901 


48% 


96.9% 


95.2% 


ECO100095 


PMU100147 


62% 


87.5% 


82.7% 


ECO100095 


PRT102627 


94% 


48.8% 


100% 


ECO 100095 


PAE204405 


57% 


100% 


100% 


ECO 100095 


PPU105993 


58% 


100% 


100% 


ECO100095 


PSY103835 


57% 


100% 


100% 


ECO 100095 


SPA100031 


91% 


67.1% 


98.1% 


ECO100095 


STY1032U 


98% 


100% 


100% 


ECO100095 


STM102893 


98% 


100% 


100% 


ECO 100095 


SAU801186 


52% 


81.2% 


79.0% 


ECO 100095 


SEP200917 


45% 


97.4% 


952% 


ECO100095 


SHA100846 


45% 


97.4% 


95.2% 


ECO 100095 


SMU100O85 


51% 


89.6% 


78.1% 


ECO100095 


SPN401509 


56% 


80.7% 


73.5% 


ECO100095 


SPY201170 


54% 


82.0% 


71.3% 


ECO100095 


TPA100386 


49% 


79.4% 


72.7% 


ECO100095 


VCH102362 


75% 


100% 


100% 


ECO100095 


YPS001023 


94% 


100% 


100% 


ECO100096 


ABA100621 


49% 


71.5% 


100% 


ECO 100096 


BFR100706 


38% 


88.5% 


63.3% 


ECO100096 


BPT102979 


52% 


97.7% 


98.0% 


ECO 100096 


BCE1 06802 


54% 


97.4% 


97.0% 


ECO100096 


BFU 106497 


53% 


97.4% 


97.0% 


ECO100096 


BMA100406 


55% 


97.4% 


97.0% 


ECO100096 


CJU100121 


42% 


94.1% 


95.9% 


ECO100096 


CPN200093 


36% 


892% 


95.8% 


ECO100096 


CTR200809 


38% 


90.8% 


95.5% 


ECO100096 


EBC100070 


97% 


37.4% 


100% 


ECO 100096 


EFA204185 


36% 


16.4% 


7.8% 


ECO100096 


ECO100096 


100% 


100% 


100% 


ECO 100096 


HINi011I8 


77% 


99.7% 


99.7% 


ECO100096 


HPY101035 


42% 


95.4% 


96.9% 


ECO100096 


KPN300609 


94% 


76.7% 


100% 


ECO100096 


MCA100416 


54% 


97.0% 


97.7% 


ECO100096 


NGO101976 


48% 


96.4% 


96.4% 


ECO 100096 


NME200247 


49% 


96.4% 


96.4% 


ECO100096 


PMU 100148 


77% 


99.7% 


99.7% 


ECO100096 


PRT100138 


86% 


90.8% 


100% 


ECOl 00096 


PAE204404 


57% 


99.7% 


100% 


ECO100096 


PPU1 11772 


56% 


99.7% 


100% 


ECO 100096 


PSY103834 


56% 


99.7% 


100% 


ECO100096 


SPA100700 


87% 


100% 


100% 


ECO100096 


STY103212 


98% 


100% 


100% 


ECO100096 


STM102894 


98% 


100% 


100% 
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Query LocusID 


Homolog LocusID 


Identity 


Query Coverage 


HomoTog Coverage 


ECQH)0096 


VCH102361 


73% 


1 AAO/ 
100% 


100% 


ECO 100096 


YPS0OIO24 


92% 


100% 


OB 7 0/ 


ECO100102 


BPT102719 


31% 


1 AAO/ 
100% 


AA 00/ 

yy.2% 


ECO100102 


BCE1 11292 


30% 


A1 CO/ 

91.5% 


yi.Z% 


ECO100102 


BFU101371 


31% 


AO AO/ 

98.4% 


QC so/ 


ECO100102 


BMA 106052 


30% 


AO AO/ 

98.4% 


AO 00/ 

y 0.0/0 


ECO100102 


«^T\/^4 A\ A- m 

EBC103445 


80% 


1 AAO/ 

100% 


1 AAO/ 

lOOyo 


ECO100102 


ECO100102 


100% 


1 AAO/ 
100% 


1 AAO/ 

100% 


ECO100102 


KPN301732 


76% 


AT ^O/ 
97.6% 


1 AAO/ 
100% 


ECO100102 


LPN101018 


24% 


AO AO/ 

93.9% 


AA TO/ 

yz. /% 


ECOl 00102 


PRT1 04002 


53% 


AO yifl/ 

95.470 


AT TO/ 


ECO100102 


PPU100250 


45% 


1 ^ AO/ 

16.2% 


AT ^ro/ 


ECO100102 


SPA101560 


89% 


1 AAA/ 

100% 


1 AAO/ 

100% 


ECO100102 


STY103218 


89% 


1 AAO/ 

100% 


1 AAO/ 
100% 


ECO100102 


VCH102393 


40% 


AT /CO/ 

97.0% 


AT £0/ 

y /.o% 


ECO 100102 


YPS000901 


67% 


1 AAO/ 

100% 


AO Q0/ 


ECO100113 


BAN100380 


110/ 
31% 


S4.0% 


OQ TO/ 


ECO100113 


BMA109038 


39% 


AA CO/ 
90.0% 


03.0% 


ECO100113 


CDF 1 00440 


28% 


OA AO/ 

89.0% 


0£ £0/ 

06.0% 


ECO100113 


EBC103453 


AAO/ 

92% 


1 AAO/ 

100% 


1 AAO/ 
100% 


ECO100113 1 


ECO100H3 


1 AAA/ 

100% 


1 AAO/ 
100% 


1 AAO/ 
100% 


ECO100113 


KPN307947 


AAA/ 

90% 


T/l AO/ 

24.0% 


AT AO/ 


ECO100113 ! 


KPN301741 


93% 


1 AAO/ 

100% 


Qfi 10/ 

yo.i% 


ECO100113 


LMO 100648 


28% 


OA >10/ 

89.4% 


AC TO/ 


ECO100113 


MAV 105850 


25% 


90.6% 


AAO/ 

92% 


ECO100113 


MAV 106443 


25% 


A1 TO/ 

91.7% 


94.7% 


ECO100113 


MAV 106442 


28% 


86.6% 


85% 


ECO100113 


NGO 100599 


34% 


96.9% 


94.6% 


ECO100113 


NME201814 


34% 


96.9% 


Ail AA/ 

94.2% 


ECO100113 


PRT1 06136 


79% 


98.8% 


96.2% 


ECO100113 


PAE204765 


47% 


A*7 /TO/ 

97.6% 


AT TO/ 

97.7% 


ECO100113 


PPU100113 


46% 


rvT ^0/ 

97.6% 


AO OO/ 

98.8% 


ECOl 001 13 


SPA100332 


95% 


100% 


t AAA/ 

100% 


ECOl 001 13 


STY103289 


96% 


1 AAO/ 

100% 


1 AAO/ 
100% 


ECO100113 


STM102972 


96% 


1 AAO/ 

100% 


1 AAO/ 
100% 


ECO100113 


VCH102380 


65% 


AO OO/ 

98.8% 


AO AO/ 

98.0% 


ECO100113 


YPS0O0871 


86% 


99.6% 


AA ZTQ/ 

99.6% 


ECO100115 


ABA105536 


46% 


33.7% 


68.0% 


ECO100115 


BAN101075 


31% 


0.2% 


45.1% 


ECO100115 


BAN101226 


36% 


17.1% 


43.7% 


ECO 1001 15 


BPTl 00902 


56% 


0.2% 


43.0% 


ECO100115 


BCE106716 


56% 


21.1% 


4.3% 


ECO100115 


BFU1001U 


53% 


25.9% 


13.8% 


ECO 1001 15 


BMA108930 


53% 


20.8% 


CC itO/ 

55.4% 


ECO100115 


CJU 100863 


21% 


4.3% 


34.7% 


ECO100115 


AA JAA 

CDP100400 


34% 


98.7% 


98.2% 


ECO100115 


EBC102752 


90% 


100% 


100% 


ECO100115 


EFA202405 


34% 


37.1% 


59.6% 


ECO100115 


EFM201990 


32% 


37.1% 


60.1% 


ECO100115 


ECO100H5 


100% 


100% 


100% 


ECO100115 


HLNI01204 


68% 


34.6% 


58.6% 


ECO100115 


KPN301446 


91% 


100% 


100% 


ECO100115 


LPN100774 


45% 


10.3% 


53.9% 


ECO100115 


LMOl 02367 


35% 


37.1% 


60.1% 
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Query LocusID 


Homolog LocusID 


Identity 


Ouerv Coverage 


Homolop Coverage j 


ECO100115 


MCA10U50 


39% 


19.0% 


44.2% 


ECO100115 


MAV103601 


31% 


1.6% 1 


33.2% 


ECO100115 


MTU405582 


32% 


50.2% 


84.6% 


ECO100115 


MTU400129 


32% 


64.4% 


99.8% 


ECO 1001 15 


MGE100278 


25% 


16.5% 


26.3% 


ECO100115 


MPN100447 


27% 


3.2% 


46,0% 


ECO100115 


NGO100078 


53% 


15.6% 


38.0% 


ECO100115 


NME201414 


56% 


33.2% 


19.6% 


ECO100115 


PMU100894 


72% 


100% 


100% 


ECO100115 


PRT102280 


78% 


100% 


100% 


ECO100115 


PAE205011 


54% 


83.0% 


99.3% 


ECO100115 


PPU105100 


54% 


18.7% 


59.2% 


ECO100115 


PSY103646 


52% 


20.2% 


43.8% 


ECO100115 


SPA100283 


80% 


60.5% 


100% 


ECG100115 


STY1 03294 


93% 


100% 


100% 


ECO100115 


STM102974 


93% 


100% 


100% 


ECO100115 


SAU101783 


36% 


39.8% 


77.6% 


ECO100115 


SPY200777 


30% 


3.8% 


35.4% 


ECO100115 


VCH102378 


74% 


100% 


99.1% 


ECG100115 


YPS000863 


79% 


14.0% 


37.7% 


ECO100116 


ABA101766 


38% 


95.6% 


96.4% 


ECO100116 


BAN101609 


33% 


96.4% 


97.0% 


ECG100116 


BAN103383 


44% 


96.4% 


97.0% 


ECO100116 


BFR10034 


33% 


94.1% 


98.2% 


ECO100116 


BFR11786 


32% 


94.1% 


98.7% 


ECG100116 


BPT1 00905 


63% 


99.6% 


79.2% 


ECO100116 


BCE1 10691 


66% 


97.9% 


79.7% 


ECG100116 


BFU100082 


65% 


99.4% 


92.9% 


ECO100116 


BMA107694 


65% 


99.6% 


80.0% 


ECO100116 


CPN201018 


35% 


93.0% 


94.1% 


ECO100116 


CTR200835 


38% 


93.0% 


94.0% 


ECO100116 


CBO103777 


35% 


93.5% 


95.2% 


ECO100116 


CBO 103904 


37% 


94.7% 


97.0% 


ECO100116 


CDF102926 


38% 


94.7% 


95.7% 


ECO100116 


CDP100695 


36% 


96.6% 


98.7% 


ECO100116 


EBC102751 


97% 


100% 


100% 


ECO100116 


EFA202404 


43% 


96.4% 


97.4% 


ECO1001I6 


EFM201020 


42% 


96.4% 


97.4% 


ECO100116 


ECO100116 


100% 


100% 


100% 


ECO100116 


HIN101203 


81% 


100% 


99.2% 


ECO100116 


KPN301447 


96% 


100% 


99.8% 


ECO100U6 


LPN100879 


62% 


100% 


100% 


ECO100116 


LMO101821 


43% 


95.4% 


96.4% 


ECO100116 


MCA101643 


37% 


98.1% 


98.8% 


ECO100116 


MAV101768 


37% 


95.6% 


97.0% 


ECO100U6 


MBV103409 


37% 


97.0% 


98.7% 


ECO100116 


MLP101416 


34% 


97.0% 


98.7% 


ECO100116 


MTU200460 


37% 


97.0% 


98.7% 


ECO100116 


MGE100277 


32% 


93.7% 


96.3% 


ECO100116 


MPN100448 


33% 


93.7% 


96.3% 


ECO100116 


NGO 100074 


63% 


99.4% 


81.1% 


ECO100116 


NME201415 


63% 


99.4% 


81.1% 


ECO100116 


PMU100893 


88% 


100% 


100% 


ECO100116 


PRT102281 


90% 


100% 


99.8% 
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Query Locus ID 


Homolog LocusID 


Identity 


Query Coverage 


Homolog Coverage 


ECO100116 


PAE201586 


42% 


98.5% 


99.4% 


ECO100U6 


PPU103115 


40% 


98.5% 


99.4% 


ECO100116 


PSY 104859 


41% 


98.5% 


99.4% 


ECO100116 


SPA100336 


96% 


83.5% 


100% 


ECO100116 


STY103295 


98% 


100% 


99.8% 


ECO100116 


STM103004 


98% 


100% 


100% 


ECO100116 


SAU801096 


44% 


95.6% 


96.4% 


ECO1001I6 


SEP200806 


45% 


95.6% 


96.4% 


ECO100116 


SHA100679 


44% 


95.6% 


96.8% 


ECO100116 


SMU1 01480 


36% 


93.2% 


74.7% 


ECO100116 


SPN401048 


38% 


93.2% 


76.5% 


ECO1001I6 


SPY200778 


35% 


97.5% 


77.7% 


ECO100116 


VCH102377 


89% 


100% 


99.8% 


ECO 1001 16 


YPS000861 


93% 


100% 


99.8% 


ECO100117 


BPT103682 


29% 


59.3% 


100% 


ECO100117 


BCE104746 


29% 


31.9% 


62.8% 


ECO100117 


BFU101391 


43% 


46.8% 


46.8% 


ECO100117 


BFU100907 


41% 


49.3% 


57.2% 


ECO100117 


EBC102750 


48% 


89.5% 


100% 


ECO100117 


EFA200061 


22% 


25.8% 


20.0% 


ECO100117 


EFA201265 


. 22% 


25.8% 


20.0% 


ECO100117 


ECO100117 


100% 


100% 


100% i 


ECO100117 


KPN305987 


52% 


91.2% 


96.4% 


ECO100117 


MCA102218 


28% 


10.9% 


46.5% 


ECO100117 


PPU101283 


36% 


77.3% 


91.2% 


ECO100117 


SPA101490 


56% 


88.8% 


100% 


ECO100117 


STY103296 


58% 


87.8% 


98.0% 


ECO100117 


SAU800401 


27% 


13.1% 


11.6% 


ECO100117 


SPN401793 


29% 


14.1% 


22.7% 


ECO100118 


ABA 100442 


74% 


99.3% 


98.7% 


ECO100118 


BPT102269 


72% 


99.1% 


99.5% 


ECO100118 


BCE109527 


73% 


99.3% 


99.8% 


ECO100118 


BFU107948 


74% 


99.3% 


98.1% 


ECO100118 


CJU100774 


60% 


98.4% 


99.2% 


ECO100118 


EBC106332 


96% 


82.5% 


100% 


ECO100118 


ECO100118 


100% 


100% 


100% 


ECO100118 


HPY1 00766 


61% 


98.4% 


99.2% 


ECO100118 


KPN30O389 


95% 


18.2% 


90.2% 


ECO100118 


KPN300864 


96% 


93.5% 


100% 


ECO100118 


MCA101955 


73% 


95.5% 


99.3% 


ECO100118 


NGO101334 


74% 


99.3% 


99.8% ! 


ECO100118 


NME201620 


74% 


99.3% 


99.8% 


ECO100118 


PMU100204 


81% 


99.9% 


99.5% . . 


ECO100118 


PRT101310 


88% 


100% 


100% 


ECO100118 


PAE201786 


79% 


99.0% 


98.8% 


ECO100118 


PPU1 11285 


79% 


96.3% 


96.0% j 


ECO100118 


PSY101607 


78% 


99.0% 


96.3% 


ECO100118 


SPA100508 


94% 


60% 


100% 


ECO100118 


STY103298 


96% 


100% 


100% 


ECO100118 


STM103008 


96% 


100% 


100% 


ECO100118 


TPA100176 


31% 


13.3% 


37.0% 


ECO100118 


VCH100593 


82% 


99.7% 


99.7% 


ECO100118 


YPS000857 


90% 


100% 


100% j 


ECO100135 


ECO 1001 35 


100% 


100% 


100% 
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Query LocusID 


Homolog LocusID 


Identity 


Ouerv Coverage 


Homolog Coverage 


ECO 1001 35 


SPA106733 r 


29% 


40.3% J 


97.3% 


ECO100135 


STY 103389 


32% 


98.5% 


97.2% 


ECO100136 


ECO100136 


100% 


100% 


100% 


ECO100136 


SPA1 09709 


36% 


58.6% j 


982% 


ECO100136 


STY103420 j 


37% 


86.4% 


84.8% 


ECO100139 


ABA100394 


33% 


96.0% 


97.9% 


ECO100139 


ABA100739 


36% 


92.5% 


97.6% J 


ECO100139 


BFU103132 


30% 


68.4% 


77.5% | 


ECO100139 


BFU1 03304 


38% 


76.4% 


92.9% 1 


ECO100139 


BFU103222 


33% 


94.7% 


87.3% 


ECO100139 


ECO100139 


100% 


100% 


100% 


ECO100139 


KPN302863 


35% 


94.3% 


96.5% 


ECO100139 


PRT105280 


35% 


93.9% 


93.8% 


ECO100139 


PAE204081 


35% 


93.2% 


94.2% 


ECO100139 


SPA1 04033 


45% 


96.4% 


100% 


ECO100139 


STY103424 


60% 


98.3% 


98.4% 


ECO100139 


SHA101318 


19% 


16.4% 


31.5% 


ECO100139 


SPY100688 


22% 


23.1% 


53.9% 


ECO100139 


TPA100432 


23% 


19.5% 


46.1% 


ECO100139 


YPS000524 


36% 


93.1% 


91.7% 


ECO100140 


ABA100736 


35% 


95.1% 


96.7% 


ECO100140 


ABA100392 


40% 


88.2% 


88.2% 


ECO100140 


BPT101284 


41% 


87.0% 


75.9% 


ECO100140 


BFU103245 


39% 


94.7% 


97.9% 


ECO100140 


BFU103302 


47% 


93.9% 


93.5% 


ECO100140 


ECO100140 


100% 


100% 


100% 


ECO100140 


KPN302865 


39% 


91.5% 


93.6% 


ECO100140 


PRT105676 


42% 


95.9% 


90.7% 


ECO100140 


PAE204082 


39% 


87.0% 


85.1% 


ECO100140 


PPU101843 


34% 


43.9% 


81.8% 


ECO100140 


PSY105016 


36% 


87.0% 


76.6% 


ECO100140 


SPA104034 


48% 


97.2% 


95.2% 


ECO100140 


STY103425 


57% 


100% 


100% 


ECO100140 


YPS000522 


41% 


96.7% 


92.3% 


ECO100142 


ABA105477 


47% 


88.1% 


85.6% 


ECO100142 


BAN112513 


43% 


84.3% 


77.8% 


ECO100142 


BFR100366 


39% 


84.9% 


88.9% 


ECO100142 


BPT101655 


40% 


88.7% 


85.3% 


ECO 100142 


BCE102263 


49% 


49.7% 


62.9% 


ECO100142 


BFU102640 


37% 


99.4% 


85.7% 


ECO100142 


BMA109305 


39% 


95.0% 


87.4% 


ECO100142 


CJU100058 


29% 


79.9% 


80.3% 


ECO 100142 


CAC103001 


38% 


84.9% 


48.7% 


ECO100142 


CBO100107 


41% 


87.4% 


90.7% 


ECO100142 


CDF102980 


37% 


862% 


50.6% 


ECO100142 


CDP100654 


36% 


93.7% 


90.6% 


ECO100142 


EBC102063 


85% 


88.1% 


100% 


ECO100142 


ECO100142 


100% 


100% 


100% 


ECO100142 


HIN 100063 


56% 


99.4% 


98.1% 


ECO100142 


HPY101019 


31% 


86.2% 


83.4% 


ECO 100142 


KPN308497 


77% 


100% 


100% 


ECOl 00142 


LPN102612 


37% 


88.1% 


98.6% 


ECO100142 


LMO102140 


42% 


84.9% 


84.3% 


ECO100142 


MCA1 00865 


38% 


99.4% 


98.8% 
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Query LocusID 


Homolop LocusID 


Identity 


Ouery Coverage 


Homolog Coverage 


ECO 100142 


MAV104005 


38% 


88.7% 


78.0% 


ECO100142 


MBV102400 


33% 


843% 


73.9% 


ECO100142 


MLP100158 


35% 


80.5% 


70.2% 


ECO100142 


MTU203554 


33% 


84.3% 


73.9% 


ECO100142 


NGO101986 


42% 


84.3% 


81.1% 


ECO100142 


NME200881 


43% 


84.3% 


81.1% 


ECO100142 


PMU100865 


57% 


92.5% 


88.5% 


ECO100142 


PRT102518 


62% 


97.5% 


95.2% 


ECO100142 


PAE204724 


51% 


96.9% 


94.4% 


ECO 100142 


PPU108440 


48% 


99.4% 


99.4% 


ECO100142 


PSY105158 


53% 


96.9% 


93.3% 


ECO100142 


SPA104025 


80% 


98.1% 


100% 


ECO100142 


STY1 03427 


89% 


90.6% 


90.6% 


ECO100142 


SAU800516 


40% 


89.9% 


89.9% 


ECO100142 


SEP201849 


43% 


87.4% 


86.8% 


ECO100142 


SHA100112 


41% 


87.4% 


86.8% 


ECO100142 


SMU100995 


42% 


86.2% 


77.7% 


ECO100142 


SPN400269 


36% 


97.5% 


552% 


ECO100142 


SPY200833 


44% 


87.4% 


83.1% 


ECO100142 


VCH100582 


58% 


100% 


95.2% 


ECO100142 


YPS000787 


65% 


93.1% 


93.1% 


ECO100144 


ABA104779 


45% 


93.8% 


91.9% 


ECO100144 


BPT100055 


48% 


88.6% 


96.8% 


ECO100144 


BCE1 07846 


45% 


88.6% 


95.6% 


ECO100144 


BFU114519 


47% 


83.1% 


86.6% 


ECO100144 


BMA109098 


46% 


81.8% 


88.9% 


ECO100144 


EBC102068 


78% 


92.9% 


902% 


ECO100144 


ECO100144 


100% 


100% 


100% 


ECO100144 


KPN306084 


76% 


92.9% 


97.6% 


ECO100144 


MCA102257 


34% 


88.3% 


93.8% 


ECO100144 


NGO101470 


46% 


91.9% 


98.6% 


ECO100144 


NME201975 


45% 


91.9% 


98.6% 


ECO100144 


PAE204720 


48% 


92.2% 


96.9% 


ECO100144 


PPU108420 


50% 


93.8% 


96.3% 


ECO100144 


PSY105149 


44% 


81.8% 


96.9% 


ECO100144 


SPA104029 


79% 


80.5% 


98.4% 


ECO100144 


STY 103460 


81% 


96.4% 


99.7% 


ECO100144 


VCH100584 


56% 


96.4% 


98.0% 


ECO100144 


YPS000781 


71% 


91.2% 


87.5% 


ECO100145 


ABA104294 


63% 


95.4% 


81.5% 


ECO100145 


BAN1 11357 


29% 


69.5% 


51.8% 


ECO100145 


BAN107715 


31% 


69.5% 


46.5% 


ECO100145 


BPT100533 


37% 


94.7% 


92.2% 


ECO100145 


BBU100167 


29% 


74.2% 


92% 


ECO100145 


BCE102976 


45% 


78.8% 


85.5% 


ECO100145 


BFU1 14180 


43% 


78.8% 


84.9% 


ECO100145 


BMA106451 


46% 


78.8% 


85.5% 


ECO 100145 


CJU100114 


26% 


76.8% 


97.5% 


ECO100145 


CTR200677 


23% 


74.2% 


85.8% 


ECO100145 


CAC101598 


27% 


71.5% 


58.5% 


ECO100145 


CBO101773 


30% 


63.6% 


54.1% 


ECO100145 


CDF1 00661 


32% 


32.5% 


30.1% 


ECO100145 


EBC102067 


98% 


100% 


100% 


ECO100145 


ECO100145 


100% 


100% 


100% 



I 
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Ouerv T .tmmicTD 


Homolog LocusID 


Identity 


Ouery Coverage 


Homolog Coverage 


ECO10014S 


HIN100061 


75% 


94.0% 


97.9% 


C\^\J l \J\J l HJ 


KPN304195 


98% 


100% • j 


100% 


FCO10014^ 


LPN103126 


72% 


88.1% 


84.2% 


F CO 100145 


MCA101582 


62% 


86.1% 


93.5% 


ECO 100145 

ijV/V A VV A ~ 


NGO101937 


45% 


82.1% 


89.1% j 


ECO100145 

A W w A 


NME200203 


45% 


82.1% 


89.1% 


ECO100145 


PMU100863 


80% 


94.0% 


97.9% 


ECO100145 


PRT101562 


94% 


100% 


100% 


ECO100145 


PAE204719 


76% 


86.8% 


88.5% 


FCO100145 


PPU108418 


68% 


85.4% 


83.8% 


FCO100145 


PSY100089 


76% 


74.2% 


96.6% 


ECO100145 


SPA 104031 


97% 


30.5% 


100% 


ECO100145 


STY103461 


98% 


100% 


100% 


ECO100145 


SAU801431 


32% 


31.1% 


80.3% 


ECO100145 


SPY200759 


50% 


19.9% 


34.7% 


ECO100145 


TPA 100095 


29% 


56.3% 


65% 


ECO100145 


VCH100585 


85% 


95.4% 


97.3% 


ECO100145 


YPS000779 


94% 


100% 


100% 


ECO100148 


CDP101536 


34% 


89.7% 


99.0% 


ECO100148 


EBC 102064 


78% 


69.9% 


100% 


ECO 1001 48 


ECO100148 


100% 


100% 


100% 


ECOl 00148 


KPN304202 


81% 


98.2% 


100% 


ECO 100 148 


PRT101564 


55% 


97.8% 


99.6% 


ECO 100148 


PAE203958 


47% 


97.6% 


99.4% 


ECO100148 

AwV*' v A ww A • w 


PPU1 07649 


45% 


96.7% 


99.5% 


ECO100148 


PSY1 08786 


45% 


97.6% 


96.6% 


ECO100148 


SPA1 01069 


75% 


100% 


100% 


ECO100148 


STY 103465 


84% 


98.7% 


100% 


ECO 100148 


STM103140 


84% 


100% 


100% 


ECOl 00148 

AaiVS V A W A t W 


VCH 100590 


48% 


97.9% 


99.5% 


ECO 100148 

IjVV A WW A^U 


YPS000776 


65% 


98.1% 


98.6% 


ECO100150 

irfW A W A «S V 


ABA103956 


29% 


98.5% 


99.0% 


ECO100150 

AwX^V A W A «/ V 


BFR104161 


21% 


22.4% 


18.2% 


ECO100150 


BFR100416 


25% 


19.0% 


17.3% 


ECO100150 


BFR10467 


21% 


20.3% 


20.2% 


ECO100150 


BFR11841 


27% 


16.1% 


17.5% 


ECO100150 


BFR101776 


25% 


17.0% 


16.1% 


ECO 1001 50 


BFR1 03964 


23% 


20.5% 


20.3% 


ECO 1001 50 


BFR104855 


31% 


11.8% 


13.2% 


ECO100150 


BFR100601 


22% 


44.0% 


33.4% 


ECO100150 

Xj V-V V-^ A WW A w 


BFR10065 


27% 


16.3% 


36.7% 


ECO100150 

Aa» A V V A <J V 


BFR103068 

A^As A^ A w *-* V W 


20% 


21.4% 


22.6% 


ECO100150 

A.* A WW A m0\f 


BFR1 04786 


21% 


26.6% 


24.0% 


ECO100150 

L^W A vw A *y w 


BFR10126 

A*SA A^ A W AArfV 


22% 


86.9% 


79.2% 


ECO100150 


BPT101854 


38% 


93.2% 


83.9% 


ECO 1001 50 


BCE1 11433 


33% 


82.6% 


98.4% 


ECO 1001 50 


BFU107418 


24% 


98.4% 


96.3% 


ECO100150 


BMA109111 


34% 


95.9% 


94.8% 


ECO100150 


EBC101242 


89% 


96.5% 


100% 


ECO100150 


ECO100150 


100% 


100% 


100% 


ECO100150 


HIN101435 


26% 


39.2% 


87.2% 


ECO100150 


KPN304208 


60% 


100% 


100% 


ECO100150 


MCA102097* 


23% 


23.2% 


19.8% 


ECO100150 


MBV102043 


29% 


9.1% 


35.6% 
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Query LocusED 


TT < _ T .jHtpl I \ 

riomoloe locus iu 


laenuiy 


Ouerv Coveraee [ 




ECO100150 


X jTTT TO) AO f\A 1 

MrU20294Z 


47/0 


9.1% 


11 £0/ 

37.0% 


ECO100150 


rRl 104yi4 


J J /o 


99.7% 


AO AA/ 

98.9% 


ECO100150 


T» 4 T?^r\0 A £. A 

PAE202464 


jo/o 


91.7% 


OA TO/ 

80.7% 


ECO100150 


PrU 10533 / 


JO /o 


94.4% 


A*7 AO/ 

97.9% 


ECO100150 


DOV1 A, 1 OOC 

PbY 101285 


J / so 


97.3% 


97.2% 


ECO100150 


CD A 1 AAOOO 

SPA1 00272 


01% 
yi/Q 


47.4% 


t AAO/ 

100% 


ECO100150 


bl Y 104353 


JJ /o 


98 1% 

70t I /0 


AA OO/ 

99.3% 


ECO100150 


OTX A 1 AO 1 >4 A 

S1M103144 


/ J /o 


100% 

IW /0 


1 AAO/ 

100% 


ECO 100150 


TT\ A 1 AACOO 

TPA100587 


2Vyo 


1 A 1 % 

1H. 1 /0 


47.5% 


ECO100150 


1 TffT T 1 AA 1 AO 

VCH 100198 


170/ 


01 1% 


90.7% 


ECO100150 


VTD C A AA ATT O 

YPSQ00978 


ZO/o 


06 5% 
yv). j /o 


98.1% 


ECO100151 


t> r?T> i ACOCO 

BFR 105362 


170/. 


7R ^% 
/o»w /o 


82.5% 


ECO 1001 51 


DX7T T 1 AOH 1 


OZ/0 


OR 1% 

70t * /0 


OO 1 A/ 

88.1% 


ECO100151 


/-iTT I! At 0*70 


JO /o 


07*0 /0 


94.0% 


ECO100151 






87 S% 
0 / /o 


Al CO/ 

91.0% 


T?/"V\ 1 AA 1 C 1 

ECUIOOIM 




100% 


100% 


1 AAO/ 

100% 


npAi aai ct 


TJpv 1 AAR74 


J l /o 


90.9% 


A /I AO/ 

94.9% 


ECO100151 




Q7% 


100% 

IW /o 


1 AAO/ 

100% 


ECO100151 


ODT1 (\0AQQ 
r K 1 1 UZ477 


16% 


89 8% 


94.0% 


ECO100151 


rrulOOlol 


47% 


R6 0% 

OU.v/0 


OT CO/ 

87.6% 


ECO100151 


on a 1 AAT7 1 

krA1002/l 


/3/0 


fn f.% 

0 / . J /o 


100% 


ECO100151 


P'PV 1 A1 >4 iCA 

oil 1 034oy 


yi/o 


100% 
11/1/ /o 


100% 


r^/^i/"v t /*v/\ i c 1 

ECO100151 


CTTk if 1 A*5 1 TC 

STM103175 


yz/o 


iUU/0 


100% 


ECO100151 


OX *T T1 AAl 1 C 

SMUlOOllo 


ioTO 


OA 7% 

yu.z/o 


88.4% 


ECO100151 


X Ti~*TJ 1 AA1 OA 
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45% 


100% 


90.9% 


ECO100171 


SMU101143 


47% 


97.5% 


96.3% 


ECO100171 


SPN400845 


49% 


97.9% 


96.0% 


ECO100171 


SPY200331 


48% 


97.5% 


97.5% 


ECO100171 


TPA100098 


31% 


25.3% 


24.3% 


ECO100171 


UUR100519 


35% 


96.3% 


98.7% 


ECO100171 


VCH102224 


85% 


100% 


99.2% 


ECO100171 


YPS001129 


90% 


100% 


100% 


ECO100179 


ABA101312 


39% 


97.7% 


99.1% 


ECO100179 


BFR105934 


35% 


97.1% 


96.0% 


ECO100179 


BPT101128 


42% 


91.5% 


89.3% 


ECO100179 


BCE111319 


41% 


99.1% 


98.3% 


ECO100179 


BFU1 14742 


39% 


96.5% 


92.7% 


ECO 100179 


BMA109885 


39% 


97.4% 


96.1% 


ECO 100179 


CJU100537 


34% 


95.3% 


99.7% 


ECO 100179 


CPN200450 


34% 


98.5% 


94.7% 


ECO100179 


CTR200507 


33% 


99.1% 


96.9% 


ECO 1001 79 


CBO103766 


28% 


23.8% 


84.3% 


ECO100179 


EBC1O3091 


89% 


100% 


100% 


ECO100179 


ECO100179 


100% 


100% 


100% 


ECO100179 


HIN100894 


65% 


98.5% 


99.1% 


ECO100179 


HPY100193 


31% 


90.9% 


91.4% 


ECO100179 


KPN301403 


92% 


90.6% 


100% 


ECO100179 


LPN102597 


35% 


99.1% 


99.7% 


ECO100179 


MCA100324 


44% 


87.7% 


90.8% 


ECO100179 


NGOl 00826 


38% 


97.7% 


96.3% 


ECO100179 


NME200082 


38% 


97.7% 


96.3% 


ECO100179 


PMU101994 


65% 


98.2% 


98.5% 


ECO100179 


PRT101247 


80% 


100% 


99.7% 


ECO100179 


PAE203644 


51% 


98.2% 


94.9% 


ECO100179 


PPU101150 


49% 


98.5% 


95.7% 


ECO100179 


PSY104345 


50% 


98.2% 


95.4% 


ECO100179 


SPA100512 


85% 


83.6% 


58.2% 


ECO100179 


STY1 03879 


95% 


100% 


100% 


ECO100179 


STM103592 


95% 


100% 


100% 


ECO100179 


VCH102216 


64% 


100% 


972% 


ECO100179 


YPS001147 


82% 


99.7% 


100% 


ECO100180 


ABA101324 


44% 


94.0% 


100% 


ECO100180 


BAN1 12058 


43% 


56.3% 


922% 


ECO100180 


BAN101529 


48% 


92.1% 


95.1% 


ECO100180 


BPT101129 


51% 


92.1% 


94.7% 


ECO100180 


BCE105358 


58% 


96.0% 


94.2% 


ECO100180 


BFU 107782 


52% 


96.0% 


93.5% 


ECO100180 


BFU105730 


55% 


96.7% 


94.8% 


ECO100180 


BMA107929 


57% 


96.0% 


94.2% 


ECO100180 


CJU100245 


47% 


90.1% 


94.5% 


ECO100180 


CPN200094 


44% 


92.1% 


91.5% 


ECO100180 


CTR200808 


43% 


90.1% 


89.5% 


ECO 100 180 


CAC102008 


45% 


89.4% 


94.3% 


ECO100180 


CBO103374 


45% 


90.7% 


93.8% 
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Query LocusID 


Homolog LocusID 


Identity 


Ouerv Coverage 


Homolog Coverage 


ECO100180 


CDF104046 


51% 


25.8% 


97.5% 


ECO 100 180 


CDF101479 


42% 


57.6% 


95.5% 


ECO 100 1 80 


CDF100018 


52% 


78.8% 


96.7% 


ECO 100180 


EBC103090 


93% 


39.1% 


78.7% 


ECO 100 180 


EBC 100020 


91% 


57.0% 


84.3% 


ECO100180 


EBC100023 


91% 


57.0% 


84.3% 


ECO100180 


EFA203430 


50% 


90.1% 


95.0% 


ECO100180 


EFM202041 


50% 


90.1% 


96.4% 


ECO100180 


ECO100180 


100% 


100% 


100% 


ECO100180 


HIN101039 


68% 


92.7% 


93.9% 


ECO100180 


HPY101356 


46% 


88.7% 


86.8% 


ECO100180 


KPN301913 


97% 


100% 


100% 


ECO100180 


LPN101746 


52% 


84.8% 


89.0% 


ECO100180 


LMO100873 


51% 


93.4% 


96.5% 


ECO100180 


MCA100038 


44% 


96.0% 


83.4% 


ECO100180 


MBV105275 


29% 


45.0% 


31.9% 


ECO100180 


NGO100839 


55% 


92.7% 


94.0% 


ECO100180 


NME200083 


55% 


92.7% 


94.0% 


ECO100180 


PMU101995 


70% 


96.7% 


95.4% 


ECO 1001 80 


PRT101246 


83% 


97.4% 


85.5% 


ECO100180 


PAE203643 


54% 


94.0% 


98.6% 


ECO 100 180 


PPU101164 


57% 


92.1% 


96.6% 


ECO100180 


PSY104344 


56% 


94.0% 


98.6% 


ECO100180 


SPA 100773 


99% 


100% 


100% 


ECO100180 


STY103910 


99% 


100% 


100% 


ECO100180 


STM103593 


98% 


100% 


100% 


ECO100180 


SAU802098 


46% 


89.4% 


91.1% 


ECO100180 


SEP204192 


44% 


89.4% 


91.7% 


ECO100180 


SHA100083 


38% 


35.8% 


93.1% 


ECO100180 


SMU100534 


46% 


92.1% 


97.1% 


ECO100180 


SPN400384 


46% 


92.1% 


97.1% 


ECO100180 


SPY201345 


46% 


92.1% 


97.8% 


ECO100180 


VCH102215 


64% 


97.4% 


96.1% 


ECO100180 


YPS001148 


89% 


100% 


83.4% 


ECO100183 


ABA101887 


65% 


93.4% 


98.9% 


ECO100183 


BAN111817 


53% 


82.3% 


68.9% 


ECO100183 


BAN 11 3438 


54% 


90.9% 


69.6% 


ECO100183 


BFR12457 


51% 


88.4% 


87.6% 


ECO100183 


BPT101141 


58% 


93.4% 


92.0% 


ECO100183 


BBU100046 


37% 


89.4% 


98.3% 


ECO100183 


BCE1 04245 


60% 


94.4% 


87.4% 


ECO100183 


BFU105066 


58% 


92.4% 


74.2% 


ECO100183 


BMA107189 


60% 


94.4% 


87.4% 


ECO100183 


CJU100010 


38% 


87.9% 


88.5% 


ECO100183 


CPN200642 


41% 


96.0% 


88.8% 


ECO100183 


CTR200293 


46% 


92.4% 


84.3% 


ECO100183 


CAC100322 


41% 


89.4% 


71.7% 


ECO100183 


CBO102656 


41% 


91.9% 


69.1% 


ECO100183 


CDF100890 


49% 


88.9% 


69.0% 


ECO100183 


CDP101132 


45% 


79.8% 


93.6% 


ECO100183 


EBC102552 


89% 


100% 


100% 


ECO100183 


EFA201281 


50% 


91.9% 


71.0% 


ECO100183 


EFM202475 


51% 


91.9% 


72.1% 


ECO100183 


ECO100183 


100% 


100% 


100% 
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Query LocusID 


Homolog LocusID 


Identity 


Query Coverage 


Homolog Coverage 


ECO100183 


HEM101036 


72% 


96.5% 


98.0% 


ECO100183 I 


HPY101305 


33% 


925% 


91.9% 


ECO100183 


KPN301923 


89% 


100% 


99.5% 


ECO100183 


LPN101418 


62% 


90.9% 


93.2% j 


ECO100183 


LMO102644 


47% 


91.9% 


69.3% 


ECO100183 


MCA101524 


38% 


90.9% 


85.8% 


ECO100183 


MAV102888 


46% 


91.4% 


76.6% 


ECO100183 


MBV102296 


48% 


91.4% 


69.3% 


ECO100183 


MLP100982 


48% 


91.4% 


75.4% 


ECO100183 


MTU202864 


48% 


91.4% 


69.3% 


ECO100183 


NGO100782 


61% 


92.9% 


95.4% 


ECO100183 


NME200070 


61% 


92.9% 


95.4% 


ECO100183 


PMU101998 


72% 


96.5% 


97.5% 


ECO100183 


PRT101243 


75% 


98.5% 


99.5% 


ECO100183 


PAE203640 


71% 


94.4% 


93.0% 


ECO100183 


PPU101167 


72% 


92.4% 


88.4% 


ECO100183 


PSY106718 


71% 


92.4% 


82.4% 


ECO100183 


STY103913 


92% 


100% 


100% 


ECO 1001 83 


STM103626 


92% 


100% 


100% 


ECO100183 


SAU801244 


44% 


92.9% 


71.8% 


ECO100183 


SEP201538 


45% 


91.4% 


692% 


ECO100183 


SHA101009 


43% 


91.4% 


70.6% 


ECO100183 


SMU100101 


45% 


93.4% 


71.9% 


ECO100183 


SPN401044 


49% 


91.4% 


69.5% 


ECO100183 


SPY200888 


46% 


90.9% 


692% 


ECO100183 


UUR100400 


35% 


38.9% 


25.3% 


ECO100183 


VCH102212 


73% 


93.9% 


90.3% 


ECO100183 


YPS001153 


84% 


97.5% 


97.5% 


ECO100184 


ABA101411 


48% 


99.6% 


96.4% 


ECO100184 


BAN1 12326 


36% 


99.2% 


99.5% 


ECO100184 


BFR102018 


33% 


96.8% 


95.0% 


ECO100184 


BPT101552 


48% 


99.7% 


99.3% . 


ECO100184 


BBU100578 


38% 


99.2% 


98.1% 


ECO 1001 84 


BCE1 15231 


49% 


99.9% 


100% 


ECO100184 


BFU107633 


47% 


99.9% 


99.7% 


ECO100184 


BMA101634 


48% 


99.9% 


100% 


ECO100184 


CJU100668 


39% 


89.9% 


88.7% 


ECO 1001 84 


CPN200079 


37% 


92.5% 


89.8% 


ECO100184 


CTR200821 


36% 


92.5% 


90.0% 


ECO100184 


CAC103161 


39% 


90.5% 


91.0% 


ECO100184 


CBO100607 


34% 


99.7% 


99.8% 


ECO1001&4 


CBO101196 


35% 


99.6% 


99.2% 


ECO100184 


CDF101163 


40% 


92.2% 


91.0% 


ECO100184 


CDP101233 


34% 


98.8% 


99.1% 


ECO100184 


EBC102553 


94% 


100% 


100% 


ECO100184 


EFA202U5 


32% 


96.2% 


96.7% 


ECO100184 


EFM200294 


32% 


99.1% 


99.5% 


ECO100184 


ECO 100 184 


100% 


100% 


100% 


ECO100184 


HIN100720 


71% 


99.9% 


99.8% 


ECO100184 


HPY 101439 


39% 


92.5% 


91.3% 


ECO100184 


-KPN301922 


95% 


100% 


100% 


ECO100184 


LPN102982 


56% 


99.6% 


99.7% 


ECO100184 


LMO100629 


35% 


99.4% 


99.8% 


ECO100184 


MCA102321 


49% 


89.4% 


86.7% 
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Query Locus ID 


Homolog LocusDD 


Identity 


Query Coverage 


Homolog Coverage 


ECO100184 


MAV100203 


34% 


7.7% 


90.7% 


ECO100184 


MAV103371 


35% 


99.2% 


99.5% 


ECO 100184 


MBV102577 


35% 


97.8% 


99.1% 


ECO100184 


MLP100747 


35% 


99.2% 


99.5% 


ECO100184 


MTU201526 


36% 


98.8% 


98.7% 


ECG100184 


MGE100266 


27% 


72.6% 


89.6% 


ECO100184 


MPN100459 


36% 


18.4% 


19.4% 


ECG100184 


NGO101860 


47% 


99.9% 


100% 


ECO100184 


NME200585 


47% 


99.9% 


100% 


ECO100184 


PMU100034 


71% 


99.9% 


99.8% 


ECO100184 


PRT101242 


85% 


98.6% 


100% 


ECO100184 


PAE203638 


58% 


99.5% 


99.6% 


ECO100184 


PPU100137 


58% 


99.9% 


99.9% 


ECO100184 


PSY104339 


57% 


99.5% 


99.6% 


ECO100184 


SPA100912 


92% 


99.3% 


100% 


ECO100184 


STY103914 


96% 


100% 


100% 


ECO100184 


STM103627 


96% 


100% 


100% 


ECO100184 


SAU801703 


33% 


93.0% 


93.8% 


ECO100184 


SEP201707 


33% 


92.0% 


92.4% 


ECO100184 


SHA100359 


33% 


96.0% 


94.2% 


ECO100184 


SMU101018 


30% 


90.8% 


92.0% 


ECO100184 


SPN400795 


31% 


95.3% 


95.0% 


ECO100184 


SPY200987 


29% 


96.2% 


96.8% 


ECO 1001 84 


TPA100661 


39% 


92.8% 


90.7% 


ECO100184 


UUR100419 


29% 


87.0% 


94.4% 


ECO100184 


VCH102211 


76% 


100% 


99.6% 


ECO100I84 


YPS001155 


88% 


100% 


99.1% 


ECO100I85 


ABA100308 


57% 


80.9% 


94.1% 


ECO100185 


BAN105768 


55% 


97.8% 


95.4% 


ECO100185 


BPT101398 


64% 


99.4% 


97.8% 


ECO100185 


BCE106451 


64% 


99.4% 


97.2% 


ECO100185 


BFU100947 


64% 


99.4% 


97.2% 


ECO100185 


BMA107589 


64% 


99.4% 


97.2% 


ECO100185 


CJU100411 


49% 


98.4% 


97.8% 


ECO100185 


CPN200335 


44% 


98.7% 


96.3% 


ECO100185 


CTR200529 


45% 


98.7% 


96.3% 


ECO100185 


CAC100492 


52% 


81.8% 


95.3% 


ECO100185 


CBO102220 


51% 


89.3% 


50.3% 


ECO100185 


CDF101403 


50% 


96.6% 


97.1% 


ECO 1001 85 


EBC102554 


96% 


100% 


100% 


ECO100185 


EFA200247 


52% 


79.6% 


96.9% 


ECO 1001 85 


EFM200190 


55% 


81.5% 


99.6% 


ECO100185 


ECO100185 


100% 


100% 


100% 


ECO100185 


HIN100386 


74% 


99.7% 


99.7% 


ECO100185 


HPY100550 


48% 


98.1% 


96.8% 


ECO100185 


KPN301921 


96% 


100% 


100% 


ECO100185 


LPN101352 


62% 


84.0% 


99.3% 


ECO100185 


LMO100159 


52% 


98.4% 


97.8% 


ECO100185 


MCA100190 


60% 


80.3% 


100% 


ECO 1001 85 


MLP1 00072 


28% 


46.4% 


27.9% 


ECO100185 


NGO100369 


64% 


98.1% 


97.2% 


ECO100185 


NME201237 


64% 


98.1% 


97.2% 


ECO100185 


PMU1 00292 


75% 


99.7% 


99.7% 


ECO100185 


PRT100667 


69% 


99.7% 


99.7% 
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Query Locus ID 


Homolog LocusID 


Identity 


Ouerv Coverage 


Homolog Coverage 


ECO100185 


PRT101241 


88% 


99.7% 


99.7% 


ECO100185 


PAE203637 


69% 


99.4% 


99.4% 


ECO100185 


PPU101171 


67% 


99.4% 


99.7% 


ECO100185 


PSY108491 


67% 


99.4% 


99.1% 


ECO100185 


SPA100816 


80% 


88.4% 


100% 


ECO100185 


STY103916 


97% 


100% 


100% 


ECO100185 


STM103628 


97% 


100% 


100% 


ECO100185 


SAU801700 


52% 


97.8% 


98.4% 


ECO100185 


SEP201704 


52% 


97.8% 


98.4% 


ECO100185 


SHA100923 


52% 


97.8% 


98.1% i 


ECO100185 


SMU100529 


54% 


79.9% 


98.0% 


ECO100185 


SPN400387 


50% 


79.9% 


98.4% 


ECO100I85 


SPY201342 


52% 


79.9% 


98.0% 


ECO100185 


VCH102210 


75% 


99.7% 


99.7% 


ECO100185 


YPS001157 


90% 


99.7% 


99.7% 


ECO100193 


CJU100189 


40% 


16.1% 


23.4% 


ECO100193 


ECO100193 


100% 


100% 


100% 


ECO100193 


KPN300059 


73% 


55.8% 


100% 


ECOI00193 


KPN301914 


66% 


88.7% 


92.5% 


ECO100I93 


TPA100217 


26% 


29.1% 


82.4% 


ECO100193 


YPS000139 


43% 


83.6% 


84.2% 


ECO100194 


ABA100529 


68% 


99.3% 


100% 


ECO100194 


BAN100354 


47% 


99.0% 


99.3% 


ECO100194 


BAN103931 


51% 


99.8% 


100% 


ECO100194 


BFR100495 


28% 


33.2% 


0.2% 


ECO100194 


BPT102439 


60% 


96.7% 


96.5% 


ECO100194 


BBU100401 


28% 


87.1% 


65.4% 


ECO100194 


BCE1 11561 


67% 


38.6% 


99.5% . 


ECO100194 


BFU106495 


56% 


99.8% 


99.1% 


ECG100194 


BMA104091 


56% 


99.8% 


99.1% 


ECG100194 


CJU100505 


45% 


99.7% 


99.6% 


ECO100194 


CPN200251 


43% 


97.4% 


97.5% 


ECO100194 


CTR200662 


39% 


99.7% 


97.9% 


ECO100194 


CAC100467 


49% 


98.4% 


97.4% 


ECO100194 


CBO100123 


28% 


98.6% 


98.3% 


ECO100194 


CDF100404 


47% 


12.8% 


76.6% 


ECO100194 


CDF1 00250 


47% 


98.6% 


97.5% 


ECO100194 


CDP100483 


40% 


99.5% 


99.1% 


ECO100194 


EBC102546 


93% 


99.8% 


100% 


ECO100194 


EFA200454 


47% 


96.7% 


95.8% 


ECO100194 


EFM201684 


47% 


99.0% 


98.6% 


ECO100194 


ECO100194 


100% 


100% 


100% 


ECO100194 


HIN100709 


75% 


99.7% 


99.5% 


ECO100194 


HPY1 00234 


39% 


99.8% 


99.7% 


ECO100194 


KPN301911 


94% 


96.0% 


100% 


ECO100194 


LPN102483 


71% 


16.4% 


100% 


ECO100194 


LMO100624 


49% 


99.0% 


98.8% 


ECO100194 


MCA101839 


.60% 


98.4% 


99.5% 


ECO100194 


MAV 102826 


40% 


97.4% 


97.6% 


ECO100194 


MBV103229 


40% 


97.4% 


97.6% 


ECO100194 


MLP100951 


27% 


93.0% 


73.1% 


ECO100194 


MTU202807 


40% 


97.7% 


97.3% 


ECO100194 


NGOl 00106 


60% 


98.4% 


98.4% 


ECO100194 


NME201412 


60% 


98.4% 


98.4% 
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Query Locus ID 


Homolog LocusID 


Identity 


Query Coverage 


Homolog Coverage 


ECO100194 


PMU101370 


76% 


99.0% 


98.9% 


ECO100194 


PRT101177 


79% 


99.7% 


99.8% 


ECO100194 


PAE200955 


71% 


99.7% 


99.6% 


ECO100194 


PPU108855 


68% 


96.5% 


99.6% 


ECO100194 


PSY105509 


67% 


99.8% 


99.8% 


ECO100194 


SPA100809 


89% 


98.4% 


100% 


ECO100194 


STY103990 


95% 


100% 


100% 


ECO100194 


SAU801263 


44% 


99.5% 


99.1% 


ECO100194 


SEP201556 


46% 


99.8% 


99.5% 


ECO100194 


SHA101150 


47% 


98.3% 


97.9% 


ECO100194 


SMU101082 


40% 


99.0% 


99.2% 


ECO100194 


SPN400243 


42% 


99.0% 


99.0% 


ECO100194 


SPY201510 


41% 


99.0% 


98.9% 


ECO100194 


TPA100158 


40% 


93.7% 


93.7% 


ECO100194 


VCH 100859 


71% 


99.0% 


99.1% 


ECO100194 


YPS002005 


84% 


99.8% 


99.8% 


ECO100195 


ABA103785 


37% 


94.5% 


93.0% 


ECO100195 


BAN1 10343 


34% 


40% 


58.0% 


ECO100195 


CBO101676 


39% 


39.6% 


91.4% 


ECO100195 


CDF101307 


33% 


43.8% 


65.2% 


ECO100195 


EBC102547 


82% 


100% 


100% 


ECO100195 


ECO100195 


100% 


100% 


100% 


ECO100195 


HIN100489 


51% 


94.5% 


92.9% 


ECO100195 


KPN301908 


85% 


100% 


100% 


ECO100195 


MCA100700 


39% 


74.9% 


93.7% 
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NGO100030 


42% 


98.7% 


98.2% 


ECO100195 


NME200229 


42% 


96.2% 


95.6% 
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PMU101170 


54% 


93.6% 
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[PRT10U78 


62% 


100% 


100% 
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PAE203386 


48% 
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PPU108086 


47% 


902% 
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PSY104415 


45% 
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952% 
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SPA100206 


89% 


37.9% 


100% 
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88% 


100% 
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VCH100860 


52% 


98.3% 


99.1% 
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71% 


100% 
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34% 
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992% 
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41% 
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44% 
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44% 


94.5% 
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47% 


100% 
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56% 


87.1% 
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50% 
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95.9% 
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54% 


97.0% 
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37% 
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40% 
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33% 
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100% 


r 100% 


ECO100197 
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47% 
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47% 
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BPT103178 


52% 


100% 


100% 


ECO100198 


BCE1 12661 


54% 


98.2% 


100% 


ECO100198 


BFU102065 


54% 


100% 
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ECO100198 


BMA106456 


54% 


98.2% 


100% 
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CJU100712 


44% 


89.4% 
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CPN200472 


36% 


95.9% 


94.1% 
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43% 


96.3% 


95.9% 
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48% 
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94.9% 
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100% 
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95% 
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100% 
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96.3% 
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ECO100198 


NME200470 


43% 
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ECO100198 


PMU101729 


49% 


95.4% 


90.8% 


ECO100198 


PRT100130 


83% 


100% 


100% 
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PAE202349 


48% 


98.6% 


98.6% 


ECO100198 


PPU109980 


50% 


97.7% 


100% 
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PSY 103687 


48% 


98.2% 


94.7% 
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SPA100613 


90% 


85.3% 


100% 


ECO 100 198 


STY103994 


92% 


97.7% 


100% 


ECO100198 


STM100618 


44% 


93.5% 


92.7% 


ECO100198 


SAU800463 


50% 


95.9% 


95.0% 
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50% 


95.9% 
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ECO100198 
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40% 
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ECO100198 
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41% 


94.5% 
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SPY200235 


42% 


94.5% 


92.7% 


ECO100198 


TPA100118 


40% 


95.4% 


95.0% 


ECO100198 


VCH100890 


53% 


96.8% 


93.3% 


ECO100198 


YPS002016 


90% 


100% 


100% 


ECO100201 


ABA104598 


65% 


100% 


100% 


ECO100201 


BPT102715 


65% 


100% 


100% 


ECO100201 


BCE100085 


66% 


99.3% 


93.6% 


ECO100201 


EBC100779 


90% 


85.4% 


97.9% 


ECO100201 


ECO100201 


100% 


100% 


100% 


ECO100201 


KPN301057 


90% 


100% 


100% 


ECO100201 


PRT1 04732 


81% 


100% 


100% 


ECO100201 
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65% 


99.3% 


97.4% 
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67% 


100% 


100% 
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SPA101642 


90% 
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STY104030 


91% 


100% 


100% 
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STM103710 


92% 


100% 
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BAN 108547 
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BAN 105487 


31% 


44.4% 
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ECO100223 


100% 


100% 


100% 


ECO 100223 


VCH100877 


26% 


64.8% 


54.4% 


ECO 100223 


YPS002852 


49% 


94.3% 


84.2% 


ECOl 00236 


ABA101495 


43% 


99.5% 


97.9% 


ECO 100236 


BAN 105079 


42% 


96.2% 


95.7% 


ECO100236 


BAN 100068 


43% 


98.6% 


98.1% 


ECO 100236 


BFR101900 


41% 


97.1% 


96.9% 


ECO100236 


BPT102346 


44% 


99.8% 


98.3% 


ECO 100236 


BCE107136 


45% 


98.3% 


89.0% 


ECO100236 


BMA109567 


45% 


98.3% 


89.7% 


ECO 100236 


CJU1 00520 


38% 


97.6% 


97.8% 


ECO 100236 


CAC 102032 


47% 


97.8% 


96.7% 


ECO 100236 


CDP101216 


45% 


96.2% 


93.5% 


ECOl 00236 


EBC103210 


88% 


100% 


100% 


ECOl 00236 


EFA200208 


46% 


99.8% 


99.3% 


ECO100236 


EFM200382 


46% 


99.5% 


99.0% 


ECO100236 


ECO100236 


100% 


100% 


100% 


ECO100236 


HIN101211 


62% 


99.8% 


99.8% 


ECO100236 


KPN302476 


86% 


100% 


100% 


ECO100236 


LPN100519 


40% 


98.1% 


97.4% 
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ECO 100236 


LMO101153 


46% 


99.5% 


99.0% 


ECO 100236 


MAV103439 


48% 


96.9% 


90.1% 


PCO 100236 


MBV104414 


47% 


96.6% 


93.3% 


PCO 100236 


MLP100899 

ATA 1 A Ww^ -7 


45% 


97.8% 


96.8% 


pro 100236 

XwVv>/ 1 VwiJ U 


MTU202391 


47% 


96.6% 


93.3% 


FCO 100936 


NGO101136 

Ji W A V A A W W 


44% 


97.8% 


97.4% 


ECO 100236 


NME201166 

A ' ATAAvAtV A A ww 


45% 


97.8% 


97.4% 


PPO 100236 


PMU1 00936 

A ATA V A \S\J J ~> \J 


65% 


99.8% 


99.0% 


FPO 100236 


PRT105196 

A Ax A A V wr A ✓ w 


71% 


99.8% 


99.8% 


ECO 10023 6 


PAE204004 


48% 


99.5% 


97.9% 


FCO 100236 


PPU1 12135 

A A V-/ A A At A w 


43% 


962% 


98.5% 


FCO 100236 


PSY105063 


45% 


99.5% 


97.4% 


FCO 100236 


SPA104265 


83% 


100% 


100% 


ECO100236 


STY 104449 


86% 


100% 


100% 


FCO 100236 


STM103861 


86% 


100% 


100% 


ECO 100236 


SMU101435 


48% 


99.8% 


99.3% 


ECO 100236 


SPN400833 


46% 


97.4% 


97.1% 


ECO 100236 


SPY201284 

L*/ A A Av W A w W^X 


47% 


99.5% 


99.0% 


ECO 100236 


TPA100346 


44% 


97.6% 


97.9% 


ECO100236 


VCH102239 


58% 


99.8% 


99.3% . 


ECO100236 


YPS002651 


72% 


100% 


99.5% 


ECO 100239 


BFU 102395 

Aw/ A X> A W A**/^ 


50% 


89.2% 


65.3% 


ECO 100239 


EBC103227 


98% 


63.9% 


100% 


FCO10023Q 

Aw^/Vw* 1 


ECO102590 

AiW A s V/ 


81% 


90.5% 


89.4% 


ECO 100239 


ECO101961 


81% 


93.7% 


100% 


ECO 100239 


ECOl 00239 

Aj V-/\-/ A WWA«^-^ 


100% 


100% 


100% 


FCO 100239 


LPN103162 

Aj«I A ^ A W«/ A V*> 


46% 


66.5% 


95.5% 


ECO 100239 

AwV/ VW. A \J\J*m<J ^ 


SEP200678 

Wr 1 <A A?V W W f W 


37% 


43.7% 


65.7% 


ECO 100240 

Aw Vw V-/ A Vva# iw 


EBC103228 

1 *AW*y A Y/*J#a*#w 


85% 


100% 


72.0% 


ECO 100240 


ECOl 00240 


100% 


100% 


100% 


ECO 100240 

A-*\-x V-/ A w wA^» w 


KPN202940 


29% 


86.8% 


94.4% 


ECO 100240 


KPN204064 


29% 


87 .5% 


95.7% 


ECO 100240 


KPN200664 


29% 


86.8% 


94.4% 


ECQ 100240 


PRT 102844 


43% 


73.7% 


100% 


ECO 100245 


EBC103236 


92% 


99.7% 


100% 


ECO 100245 


ECOl 00245 


100% 


100% 


100% 


ECO100245 


NME200125 


22% 


84.0% 


81.8% 


ECO100255 


ABA104682 


35% 


80.8% 


32.0% 
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ECO100255 


100% 
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100% 
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HIN1 00123 


. 69% 


83.3% 


28.0% 


ECO 100255 
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PMU100956 


75% 


85.8% 


25.5% 


ECO100255 


PRT1 02521 


90% 


85.8% 


26.2% 


ECO100255 


VCH103408 

T A * * W ^ «^ W W 


81% 


85.8% 


25% 


ECO 100256 


EBC101194 


98% 


31.7% 


100% 


ECO 100256 


EBC101311 

AjAVw A \/ A «p/ A A 


92% 


47.3% 


98.8% 


ECO 100256 


EBC103715 

JjA/V A w*/ 'Aw 


88% 


58.1% 


100% 


ECO 100256 


EBC 103943 

AwAw>\w A WWW ~w 


90% 


74.9% 


100% 


ECO 100256 


EBC104524 


90% 


74.9% 


100% 


ECO100256 


EBC103604 


99% 


100% 


100% 


ECO100256 


ECO100266 


100% 


100% 


100% 


ECO100256 


ECOl 00256 


100% 


100% 


100% 


ECO100256 


KPN302635 


99% 


100% 


100% 


ECO100256 


KPN301602 


99% 


100% 


100% 


ECO100256 


KPN303307 


99% 


100% 


100% 
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KPN306562 


99% 


100% 


100% 




KPN3 08721 

XVX liJUDf^l 


99% 


100%* 


100% 




KPN 302324 


99% 


100% 


100% 




KPN301774 


99% 


100% 


100% 


Fro 100956 


KPN301257 


97% 


47.9% 


100% 


FrO 1 009^6 


KPN300832 


91% 


563% 


100% 


FCO 100256 


KPN302225 


100% 


65.9% 


100% 


ECO 100256 


KPN300593 


96% 


98.8% 


100% 


FrO 100256 


KPN302471 
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100% 


100% 


FrO 100256 


KPN300357 
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FCO 100256 


KPN300875 
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FPO 100256 


STY100327 
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100% 


100% 
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STY 104877 
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100% 


100% 


FrO 100256 


STY104954 
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100% 


100% 
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100% 


100% 


100% 


FCO 100257 


ECO100257 


100% 


100% 


100% 


ECO 100257 
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100% 


100% 


100% 


FCO 100257 


KPN302325 


100% 


100% 
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ECO 100257 
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100% 


ECOl 00257 


KPN306262 
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FCO 100257 
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98% 
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89.8% 


FCO 100257 
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KPN305294 
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100% 

X W /9 


100% 

X W ro 


100% 


Fmi00957 


KPN300876 


100% 


100% 


100% 


Fro 100257 
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BAN105357 
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BFU1 14123 


36% 
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37% 
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ECO 100298 
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33% 
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100% 


QR ft% 

70.0 /0 


ECO 100298 


NGO101247 


35% 
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NME201508 


35% 


99.6% 


91.5% 
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PMU101853 


73% 


100% 


100% 
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-PRT100774 


72% 


100% 


100% 


ECOl 00298 


PRT101849 


78% 


99.6% 


97.1% 
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FCO 100361 


BMA104829 

X*F 1 ▼ A4 V A >/ ■ VJi#y 


46% 


93.7% 


96.4% 


FPO100361 


CJU 100923 


51% 


94.3% 


98.2% 


FP0 100361 


CPN200001 

V_/l 1 ~ A.WWWW A 


45% 


92.8% 


94.6% 


FCOl 00361 


CTR200001 

A iw« WWW A 


43% 


93.4% 


94.9% 


FrO 100361 

Cvy * ww J w 4 


CAC 102676 


49% 


94.3% 


99.1% 


FPO 100361 


CBO101832 


48% 


93.4% 


96.9% 


FCO 100361 


CDF102168 

VI/l A V* A WW 


49% 


94.6% 


98.5% 


FCOl 00361 


CDP 100775 


49% 


94.9% 


98.8% 


ECO100361 


EBC 102698 


93% 


66.9% 


100% 


ECO100361 


ECO100361 


100% 


100% 


100% 


ECO100361 


HPY 100160 


51% 


94.0% 


98.1% 


ECO100361 


KPN308372 


91% 


96.7% 


100% 


FrOl 00361 


LPN101016 

1 im * ^ A V A W X V 


50% 


93.4% 


95.2% 


FrO 100361 


LMO100837 

i— cLYlV- ' A WWW*/ / 


51% 


952% 


98.8% 


FrO 100161 


MCA101135 


46% 


93.1% 


952% 




MAV107SR4 


48% 

"TO fQ 


93.4% 


972% 


Fro 100161 


MBV102362 


47% 


93.4% 


96.9% 


FrOl 00161 


MLP1 01436 


46% 


93.1% 


96.0% 


Fr0 100161 


MTU200510 


47% 


93.4% 


96.4% 


FCO 100361 


NGO100861 


43% 


93.4% 


94.1% 


Fr0 100361 


NME20O933 


44% 


93.4% 


94.1% 


FrO 100361 


PMTJ101692 

X 1 VA ' A W A \JS mm 


41% 


93.7% 


94.1% 


ECO 100361 


PRT1 02370 


77% 


95.2% 


98.2% 


Fr0 100361 


PAE205238 


40% 


93.7% 


94.7% 


FrOl 00161 


PPUI10651 

i I w a ivw 1 


76% 


94.3% 


97.5% 


Fr0 100161 


PSY101277 


42% 


94.0% 


95.2% 


FCOl 00361 


SPA101546 


90% 


85.4% 


99.6% 


FCOl 00361 


STY104591 

Ot I AW^«/^A 


93% 


96.7% 


100% 


FCO 100361 


SAU801668 

unu ww a www 


49% 


92.8% 


96.9% 


ECO 100361 


SEP201642 


49% 


92.8% 


96.9% 


ECO100361 

A^x^V^ A Vv^v X 


SHA101765 

Ul Ail X w X * W*/ 


42% 


58.5% 


98.0% 


ECO100361 


VCH100105 

r V*/X X A W A W*/ 


44% 


93.7% 


92.2% 


ECO100361 


YPS000599 


43% 


93.7% 


94.1% 


ECO 100362 


ECO 100362 


100% 


100% 


100% 


ECO 100366 


BFR101178 

di xvi w i no 


24% 


14 9% 


57.0% 


ECO 100366 


CJU 100896 


19% 


55 0% 


36.6% 


ECt> 100366 


CDP 102949 

^XVA t Ua7^ J 


27% 


21.6% 


9.1% 


ECO 100366 


EBC 102756 


70% 


96.6% 


100% 


ECO 100366 


ECO 1003 66 

l^V'W A WJ WW 


100% 


100% 


100% 


ECO100366 


LPN101403 


23% 


25.9% 


39.1% 


ECO100366 


PRT103239 


21% 


51.8% 


36.1% 


ECO100366 


SPA101548 


89% 


70.7% 


100% 


ECO 100366 


STY104592 


89% 


97.0% 


472% 


ECO100366 


STM104306 


91% 


100% 


47.8% 


ECO100367 


EBC102015 


28% 


77.0% 


100% 


ECO100367 


EBC102758 


51% 


91.0% 


97.6% 



561 
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Query LocusID 


Homolog LocusID 


Identity 


Query Coverage 


Homolog Coverage 


ECO100367 


ECO100367 


100% 


100% 


100% 


ECO100367 


KPN304631 


34% 


78.8% 


100% 


ECG1 00367 


SPA101549 


68% 


98.6% 


100% 


ECO100367 


STY104593 


71% 


93.2% 


100% 


ECO100367 


STM104307 


71% 


93.2% 


100% 


ECO100381 


EEC 102823 


83% 


96.8% 


96.8% 


ECO100381 


ECO100381 


100% 


100% 


100% 


ECO100381 


KPN304881 


79% 


98.4% 


98.4% 


ECO100381 


STY 100328 


95% 


98.4% 


98.4% 


ECO100381 


STM104352 


93% 


98.4% 


98.4% 


ECO100390 


ABA 100709 


34% 


73.5% 


72.5% 


ECO100390 


BAN111886 


25% 


64.5% 


73.0% 


ECO100390 


BAN113193 


23% 


98.5% 


97.9% 


ECO100390 


BFR11820 


28% 


96% 


99.2% 


ECO100390 


BPT 100403 


35% 


21.2% 


23.5% 


ECO100390 


BBU100828 


26% 


82.8% 


82.6% 


ECO100390 


BCE 105254 


23% 


22.8% 


Ai Aft/ 

24.4% 


ECO 100390 


BFU 106002 


34% 


21.5% 


23.4% 


ECO100390 


CAC100402 


26% 


84% 


O C Oft/ 

85.3% 


ECO100390 


CBO100412 


26% 


92.2% 


92.2% 


ECO 100390 


CDF103755 


25% 


89.5% 


92.6% 


ECO 100390 


CDP101084 


24% 


62% 


58.7% 


ECO100390 


EBC101912 


81% 


99.8% 


98.3% 


ECO100390 


EFA200864 


26% 


972% 


97.9% 


ECO 100390 


EFM200128 


24% 


96% 


99.5% 


ECO100390 


ECO 100390 


100% 


100% 


100% 


ECO100390 


KPN308651 


78% 


99.8% 


99.5% 


ECO100390 


LMO 102080 


27% 


96.8% 


97.6% 


ECO 100390 


MCA101153 


32% 


95.8% 


90.7% 


ECO100390 


MAV100736 


24% 


62% 


58.2% 


ECO100390 


MLP 100687 


27% 


21.2% 


23.0% 


ECO100390 


MTU406882 


27% 


21.2% 


30.2% 


ECOl 00390 


PRT101271 


55% 


96.8% 


96.3% 


ECO100390 


PAE204279 


31% 


92% 


91.4% 


ECO100390 


PPU1 05701 


33% 


92% 


922% 


ECO100390 


PSY101474 


31% 


99.8% 


99.5% 


ECOl 00390 


SPA101178 


81% 


100% 


98.3% 


ECO100390 


STY100355 


83% 


100% 


100% 


ECOl 00390 


SAU801345 


23% 


97.8% 


98.9% 


ECQ100390 


SEP202011 


21% 


97.8% 


98.9% 


ECO100390 


SHA100866 


23% 


72.2% 


73.8% 


ECO100390 


SPN201925 


26% 


16% 


16.3% 


ECOl 00390 


TPA100619 


26% 


63.2% 


62.9% 


ECO100390 


VCH103245 


28% 


72.8% 


74.4% 


ECO100390 


YPS002608 


60% 


98.8% 


96.9% 


ECO100394 


ABA104913 


59% 


99.6% 


97.2% 


ECO100394 


BAN1 12358 


49% 


94.7% 


95.8% 


CCU10U394 


BAN 1001 03 


CIO/ 

■52% 


AO 00/ 

982% 


97.6% 


ECO100394 


BCE101232 


61% 


94.7% 


100% 


ECO100394 


BMA101593 


62% 


95.2% 


96.0% 


ECO100394 


EBC101456 


91% 


54.9% 


100% 


ECOI00394 


EFA201477 


33% 


98.7% 


98% 


ECO100394 


ECO100394 


100% 


100% 


100% 


ECO100394 


KPN304910 


93% 


100% 


100% 



WO 02/077183 PCT/US02/09107 



OiM»rvT nriic II) 


Homnloe LocusID 


Identity 


Query Coverage 


Homolog Coverage 


FCOl 00394 


MPN 100528 


22% 


71.8% 


65.7% 


FC0 100394 


PRT102253 


60% 


96.9% 


97.6% 


fcoioo3Q4 


PAE205092 


63% 


95.6% 


94.2% 


FCOl 00394 


PPU106756 


60% 


93.2% 


95.3% 


FCOl 00394 


PSY101310 


63% 


96.9% 


97.2% 


FCO 100394 


SPA101112 


94% 


98.0% 


99.6% 


ECO 100394 


STY 100389 


95% 


99.6% 


99.8% 


FCO 1003 94 


STM 100031 


95% 


99.6% 


99.8% 


FCO 100394 


YPS002596 


81% 


98.7% 


97.4% 


FCO1003Q5 


BAN101050 


31% 


73.1% 


70.0% 


FCOl 00395 


BAN109230 


33% 


73.1% 


69.8% . 


FCOl 00395 


BFR 12473 

X_#X XX X X*~ J «# 


26% 


64.0% 


70.0% 


FP0 100395 


CAC100478 

X/X»V^ X WvT t U 


27% 


64.1% 


82.9% 


ECO 100395 


CDF1 04139 


31% 


74.4% 


712% 


ECO 100395 


EBC101457 


84% 


79,0% 


100% 


ECO 100395 


EFA202423 


30% 


68.6% 


65.8% 


ECO 100395 


EFM200019 


29% 


82.5% 


82.5% 


ECO 100395 


ECO 100395 


100% 


100% 


100% 


ECO 100395 


KPN304911 


79% 


99.5% 


99.5% 


ECO 100395 


LMO100614 


31% 


73.2% 


70.7% 


FCO 100395 


SPA105253 


92% 


4.5% 


90% 


FCO 100395 


SPA101113 

UX i» X V X X *- ~* 


81% 


44.1% 


100% 


FCOl 00395 


STY 100390 


83% 


99.8% 


99.8% 


FCOl 00395 


STM 100032 

%J X X~X X vvUJ* 


83% 


99.8% 


99.8% 


FCOl 00395 


SPN400948 

4m7X A fTVV/TW 


29% 


72.9% 


70.0% 


FCO 100395 


SPY201005 


32% 


75.0% 


73.0% 


FCO 100395 


YPS002575 


57% 


99.5% 


99.0% 


FCOl 00402 


BAN107351 


47% 


26.1% 


35.6% 


FCO 100402 


BFR102271 


29% 


60.9% 


24.5% 


FC0 100402 


FBC 103463 


93% 


100% 


100% 


FCO 100402 


ECO100402 


100% 


100% 


100% 


FCO 100402 


KPN301340 

XXX lUVX^^V 


90% 


100% 


97.5% 


ECO 100402 


PRT101447 

A XX X X V X » * / 


89% 


100% 


100% 


ECO 100402 


PPU108585 


56% 


88.7% 


79.7% 


ECO 100402 


PSY105486 


56% 


88.7% 


79.0% 


ECO 100402 


SPA103227 


96% 


100% 


100% 


ECO 100402 


STY 100420 

XXX ww*x» v 


96% 


100% 


100% 


ECO 100402 


STM 100071 

X XTX x vw / X 


96% 


100% 


100% 


FCO 100402 


YPS000205 


82% 


98.3% 


100% 


ECO 100404 


EBC101277 


64% 


58.8% 


100% 


ECO 100404 

i_>X/X/ 1 wvrtvrt 


FCO 100404 


100% 


100% 


100% 


ECO 100404 


KPN303448 


70% 


88.4% 


97.8% 


FC0 100404 


SPA103216 


73% 

i J /a 


89 9% 

07«7 /O 


100% 


ECO 100404 

L^\s\J 1UUWT 


STY 100423 


74% 


89.9% 


100% 


FCO 100407 


ABA 103638 

ADA 1 \JJ\JJ O 


62% 

\JXr /0 


91 7% 

7 1*/ /□ 


91 0% 


FCO 100407 


RANI 10020 


44% 


96 1% 


100% 

IvV/O 


FCO 1 00407 


RAN104354 


54% 

«/*t /o 


96 2% 


97 4% 


ECO100407 


BFR104554 


42% 


88.5% 


83.5% 


ECO100407 


BPT100012 


39% 


94.9% 


85.5% 


ECO100407 


BCE102848 


40% 


96.2% 


84.8% 


ECO100407 


BFU108808 


42% 


96.8% 


86.9% 


ECO 100407 


BMA 102272 


41% 


93.6% 


81.5% 


ECO 100407 


CJU100351 


50% 


98.7% 


100% 


ECO 100407 


CPN200977 


27% 


98.7% 


100% 



t 
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Query LocusID 


Homoloe LocuslU 


Identity 


Query Coverage 


Homoiog Coverage 


ECO100407 


CTR200105 


37% 


99.4% 


no *?o/ 
Vo.7% 


ECO100407 


CAC100789 


52% 


98.7% 


no nn/ 


ECO100407 


CBO101079 


57% 


98.7% 


QO AO/ 


ECO100407 


CDF101258 


55% 


r»*7 AO/ 

97.4% 


y .>. o/o 


ECO100407 


CDP100290 


42% 


94.2% 


74.3 /O 


ECO100407 


EBC101280 


98% 


100% 


1 AflO/ 


ECO100407 


ECO 100407 


100% 


1 AAA/ 
100% 


1 n/w 


ECO100407 


HIN101269 


76% 


100% 


OA *>0/ 


ECO100407 


HPY100002 


50% 


99.4% 


AO/ 

99 .4% 


ECO100407 


KPN308524 


100% 


100% 


87.2% 


ECO100407 


LPN100108 


43% 


90.4% 


90.3% 


ECO100407 


MCA103693 


60% 


90.4% 


85.4% 


ECO100407 


MAV100814 


44% 


91.7% 


85.9% 


ECO100407 


MBV102145 


44% 


91.7% 


87.5% 


ECO100407 


MLP 100346 


42% 


90.4% 


OO OD/ 

88.8% 


ECO 100407 


MTU201397 


44% 


91.7% 


aa no/ 

90.9% 


ECO 100407 


NGO100984 


45% 


96.8% 


94.3% 


ECO 100407 


NME200821 


44% 


96.8% 


94.3% 


ECO100407 


PMU100731 


73% 


100% 


99.4% 


ECO100407 


PRT1 00213 


83% 


99.4% 


99.4% 


ECO100407 


PAE204050 


56% 


100% 


98.1% 


ECO100407 


PPU104481 


57% 


100% 


98.1% 


ECO100407 


PSY108754 


52% 


100% 


98.1% 


ECO100407 


SPA1 08738 


90% 


100% 


90.2% 


ECO 100407 


STY 100427 


91% 


100% 


100% 


ECO100407 


STM100098 


91% 


100% 


100% 


ECO100407 


SAU801767 


58% 


82.7% 


96.2% 


ECO100407 


SEP202131 


56% 


96.2% 


97.4% 


ECO100407 


SHA101039 


58% 


94.9% 


96.7% 


ECO100407 


SPN400161 


54% 


100% 


100% 


ECO100407 


VCH102234 


65% 


98.7% 


89.0% 


ECO 100407 


YPS002509 


86% 


99.4% 


99.4% 


ECO100408 


ABA101072 


44% 


97.8% 


91.3% 


ECO100408 


BAN105057 


32% 


87.8% 


93.1% 


ECO 100408 


BAN1 10216 


35% 


87.8% 


93.1% 


ECO100408 


BFR1 05998 


27% 


62.6% 


28.6% 


ECO100408 


BPT100014 


45% 


95.0% 


88.3% 


ECO 100408 


BBU100107 


33% 


85.6% 


86.2% 


ECO100408 


BCE1 11817 


43% 


97.8% 


93.8% 


ECO100408 


BFU 113008 


42% 


97.1% 


93.8% 


ECO100408 


BMA102557 


44% 


97.8% 


93.8% 


ECO100408 


CJU100350 


34% 


89.9% 


93.9% 


ECO100408 


CPN200850 


23% 


91.4% 


78.5% 


ECO100408 


CTR200214 


25% 


90.6% 


79.3% 


ECO100408 


CAC103566 


31% 


87.8% 


91.1% 


ECO100408 


CBO100362 


33% 


89.2% 


97.2% 


ECO 100408 


CDF101805 


31% 


66.9% 


55.6% 


ECO 100408 


CDP100352 


32% 


91.4% 


66.8% 


ECO100408 


EBC101281 


93% 


100% 


100% 


ECO100408 


EFA202204 


34% 


87.8% 


81.5% 


ECO100408 


* EFM1 00580 


31% 


90.6% 


91.3% 


ECO100408 


ECO100408 


100% 


100% 


100% 


ECO100408 


HIN101270 I 54% 


95.7% 


92.4% 


ECO 100408 


HPY100001 | 30% 


89.9% 


90.6% 



WO 02/077183 PCT/US02709107 



v^uery locus ll/ 




Identitv 


Ouerv Coverace 


Homoloff Coverage 


fpoi 00408 


KPN308025 


94% 


100% 


100% 


cpn 1 00408 


T PN 101606 

l -i 1^1 lvl UVU 


40% 


93.5% 


88.4% 


FPO 100408 

CV/Lf 1 VV*tVO 


t MO 102626 


37% 


91.4% 


96.1% 


fpo 1 0040R 


MCA 103036 


34% 


94.2% 


82.3% 


fpo 100408 

Cvy X WV^tVO 


MAV101907 


36% 


87.8% 


80.8% 


FPO 100408 


MBV101339 


34% 


87.8% 


43.4% 


FC0 100408 


MLP100320 

XVI JLjI 1W*/a.V 


35% 


89.2% 


67.4% 


FP0 100408 


MTIJ202496 

iVi. i U^v<b*T7U 


34% 


87.8% 


80.8% 


crrji 00408 


MH0 100987 


42% 

*ti« /0 


96 4% 


95.0% 


rrn 1 00408 


WMF700K70 


47% 

*TA /0 


96.4% 


95.0% 


prn 1 00408 


PMTT 100710 

X IV! C 1 W / Ju 


61% 

UJ /o 


95.7% 


92.4% 


cpn 100408 


PPT1 01111 


74% 


74.8% 


100% 


FP0 100408 


PAF204049 

JT V*T V*T7 


55% 


96.4% 


84.3% 


pen 100408 

C>V«>V-/ 1 UVTTUO 


PPU111159 

11 will I. ~> J 


53% 


96.4% 


80.7% 


FPn 100408 


PSY102195 


51% 


96.4% 


81.2% 


FCO 100408 


SPA 103662 


94% 


40.3% 


100% 


FC0 100408 


STY100428 


96% 


100% 


100% 


FPOI 00408 


STM 100099 

O X IVI 1 VVV.77 


96% 


100% 


100% 


FP0 100408 


AT 1801 524 


34% 


88.5% 


95.3% 


FP0 100408 

CvU 1 vv*t I/O 


SFP7008Q4 


15% 


88 5% 

QO.«s /O 


95 3% 


cpn 1A0408 




34% 


88 5% 


96 1% 


FPO 1 00408 


9MTI101706 


31% 

•71/0 


89 9% 


90 2% 

7V/i* /ft 


FP0 100408 


^PN400190 
or iityuj 7 v 


34% 


89.9% 


87.0% 


FP0 1 00408 


WV701 108 
or i z.\f uyo 


10% 


89 9% 

07.7 /O 


88% 


FP0 1 00408 


TPA101005 

X i/\ X V X VV«7 


15% 


90 6% 

7V.V /0 


91 5% 

7liJ /O 


FP0 100408 


T IT IP 1 00107 


78% 


54 0% 


59.8% 


FP0 100408 


VPH 107711 


53% 

JJ/0 


97 1% 


95 5% 


FP0 100408 


VP ^007 505 


84% 
OH /o 


99 3% 

77.J /O 


100% 

1 vv / o 


FP0 100409 

l_>v>V-/ 1 WtV7 


ARA105144 

ADA 1 V/-/ X *t*T 


4fi% 


98 2% 

70.X /0 


99 3% 

7/w /0 


FP0 100400 


RFB 1 1 1 77 


10% 


97 5% 

~ / » J /o 


95 9% 

yj*y /o 


FP0 1 00409 


BPT100016 
or x i vw iv 


43% 


98 2% 


93.8% 


FP0 1 00409 


RPF1 101 70 


51% 


57% 
J /o 


85% 

OJ /o 


EC0 100409 


BFU 102731 


49% 


97 8% 


97 3% 


ECO 100409 


BMA 107273 


44% 


96 9% 

y\j,y /o 


99 1% 


ECO 100409 


CJU101376 


27% 


82.5% 


88.6% 


ECO 100409 


CDP 100279 


31% 


74.8% 


72.3% 


ECO 100409 


EBC101282 

IwAJVs Jut 


84% 


99.4% 


100% 


ECO 100409 


ECO 100409 


100% 


100% 


100% 

1 vv/o 


FP0 100409 


VTTN101771 


57% 


99 1% 

-7-7. 1 /0 


01 9% 
yo»y /o 


ECO 100409 


1CPN101417 


84% 

OH /0 


QQ 4% 
yy. 1 * /o 


100% 


FPO10040Q 


T PNJ101777 


59% 


87 7% 


81 1% 
OJ.J /o 


FPO10040Q 


MPA10**£^7 


1Q<^ 


01 7% 
y 1 . / /o 


77. / /o 


FP0 100409 


MAV10141Q 


JJ /o 


87 80/. 
oZ.O /o 


87 4% 


EC0 1 00409 


MR VI 07040 


JO/0 


87 8% 
Ox.o /o 


81 4% 
0 1 .f /o 


FP0 100409 


MT PI 01 077 
ivxx^r lvl vxZ • 


15% 
JJ /o 


81 8% 
O 1 .o /o 


87 5% 


FPO 100409 


MTT 17 07 010 

IV* 1 U&v^7 J 7 


16% 

JO /o 


87 8% 
0^.0 /o 


81 4% 


ECO 100409 


NGO 100*05 


47% 
*tA /o 


97 8% 

y / .o so 


00 1% 

77. J /0 


ECO100409 


NME201944 


43% 


97.8% 


99.1% 


ECO100409 


PMU100729 


53% 


99.1% 


99.7% 


ECO100409 


PRT100O38 


67% 


99.1% 


98.8% 


ECO100409 


PAE204048 


45% 


98.8% 


98.8% 


ECO 100409 


PPU104483 


45% 


98.2% 


98.1% 


ECO 100409 


PSY102181 


47% 


99.1% 


99.4% 


ECO100409 


SPA103660 


87% 


100% 


100% 
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Homnlotr LocUSlD 


Identity 


Query Coverage 


Homolog Coverage 


EC0 100409 


STY 100439 

w X X X W ■ — ' S 


87% 


100% 


100% 


ECO 100409 


STM100100 

W X 1 TX X VrVr * \#w 


88% 


100% 


100% 


ECO 100409 


VCH 102232 


54% 


99.1% 


96.1% 


ECO 100409 


YPS002502 


75% 


99.7% 


98.5% 


ECOl 00424 


ABA 103655 


43% 


93.7% 


96.9% 


ECO 100424 


BAN1 11570 


28% 


72.4% 


88.2% 


FCO 100424 


BAN 110867 


33% 


85.1% 


92.4% 


ECO 100424 


BPT1 02774 


55% 


85.7% 


88.8% 


ECO 100424 


BCE1 04834 


59% 


78.1% 


93.5% 


ECO100424 


BFU101934 


57% 


83.2% 


93.5% 


ECO 100424 


BMA107732 


57% 


88.6% 


92.6% 


ECO 100424 


BMA109512 


56% 


89.5% 


97.2% 


ECO 100424 


CDP100360 


28% 


24.8% 


21.0% 


ECO 100424 


EBC101217 


95% 


100% 


100% 


ECO 100424 


ECO 100424 


100% 


100% 


100% 


ECO100424 


KPN303418 


85% 


47.9% 


100% 


ECO100424 


LMO 100640 


34% 


94.0% 


81.0% 


ECO 100424 


MAV102952 


26% 


39.0% 


37.5% 


ECO100424 


MBV101465 


22% 


42.9% 


27.2% 


FCO 100424 


MTU202140 

1YX X WAIVM A iw 


22% 


42.9% 


25% 


FCO 100424 


PRT 103954 


67% 


98.4% 


98.4% 


FCO 100424 


PAE201316 


57% 


95.2% 


90.0% 


ECO 100424 


PPU 101498 


57% 


97.5% 


99.7% 


FCO 100424 


PSY 104255 

XT O X lUTXh/v 


56% 


96.5% 


96.5% 


ECO 100424 


SPA102308 


76% 


100% 


100% 


FCO 100424 


STY 100721 

lJ X X X W f X 


95% 


100% 


99.1% 


FCO 100424 


SATJ801061 

kJAU O v A UU A 


35% 


81.0% 


68.0% 


FC0 100494 




38% 


81.0% 


66.6% 


ECO 100424 


SHA 100551 

vjx in x ~s <j x 


40% 


58.4% 


59.5% 


ECO 100424 


YPS001897 


66% 


96.5% 


99.7% 


ECOl 00430 


ABA101466 


71% 


93.6% 


91.3% 


ECO100430 


BAN100418 


59% 


22.6% 


96.9% 


ECO 100430 


BAN 109620 


38% 


94.3% 


97.8% 


ECO 100430 


BAN 107278 


65% 


68.2% 


93.4% 


ECO 100430 


BAN111361 

L*»U 111 l«/V X 


61% 


96.5% 


96.4% 


ECO100430 


BFR100135 

X-J X iVlVvl^J 


54% 


86.1% 


94.8% 


ECO 100430 


BPT1 00486 

A** A A A W » W 


71% 


100% 


93.2% 


ECO 100430 


BBU 100611 

XtfMmf\m* 1 WWX X 


53% 


96.7% 


95.8% 


ECO 100430 

x>>wy x w^«jv 


BCE 114986 


74% 


97.4% 


97.6% 


ECO100430 


BFU 100995 

X/l X \J\J S J w 


73% 


97.4% 


97.6% 


ECO100430 


BMA101609 


73% 


97.4% 


97.6% 


ECO 100430 


CJU100247 


55% 


94.6% 


96.2% 


ECO100430 


CPN201004 

V./A A ^ A*vf A Wf 


57% 


96.0% 


98.3% 


ECOl 00430 


CTR200078 


57% 


96.0% 


98.3% 


ECO 100430 


CAC101555 


61% 


96.9% 


94.7% 


ECO 100430 


CBO 100837 


62% 


97.4% 


93.7% 


ECO 100430 


CDF1 02979 


61% 


96.5% 


97.8% 


ECO 100430 


CDP101237 


61% 


96.2% 


96.7% 


ECO100430 


EBC101545 


95% 


98.3% 


100% 


ECO 100430 


EFA202273 


60% 


95.5% 


96.6% 


ECO100430 


EFM101238 


60% 


96.5% 


97.6% 


ECO100430 


ECO 100430 


100% 


100% 


100% 


ECO100430 


H1N100694 


69% 


97.4% 


100% 


ECO 100430 


HPY101354 


51% 


94.3% 


97.1% 
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Query LocusID 


Homolog LocusID 


Identity 


Query Coverage 


Homolog Coverage 


ECO100430 


KPN301083 


98% 


100% 


100% 


ECO100430 


LMO102362 


62% 


96.2% 


962% 


ECO100430 


MCA101655 


65% 


93.4% 


97.0% 


ECO100430 


MAV102304 


62% 


96.0% 


96.0% 


ECO100430 


MBV101048 


61% 


96.9% 


96.9% 


ECO100430 


MLP100910 


61% 


96.2% 


96.2% 


ECO 100430 


MTU202421 


61% 


96.9% 


96.9% 


ECO100430 


NGO101085 


69% 


94.1% 


97.8% 


ECO 100430 


NME201444 


69% 


94.1% 


97.8% 


ECO100430 


PMU101977 


73% 


94.6% 


98.1% 


ECO100430 


PRT101501 


88% 


100% 


100% 


ECO100430 


PAE201801 


77% 


98.3% 


97.9% 


ECO100430 


PPU109917 


77% 


98.8% 


98.4% 


ECO100430 


PSY107890 


77% 


98.3% 


97.9% 


ECO100430 


SPA 102998 


97% 


98.8% 


99.8% 


ECO100430 


STY100725 


97% 


100% 


100% 


ECO100430 


STM100438 


98% 


100% 


100% 


ECO100430 


SAU801674 


61% 


94.1% 


94.0% 


ECO100430 


SEP201649 


61% 


94.1% 


94.0% 


ECO100430 


SHA100379 


56% 


44.8% 


92.5% 


ECO100430 


SMU1 00958 


59% 


94.1% 


96.6% 


ECOl 00430 


SPN401426 


58% 


96.7% 


98.8% 


ECOl 00430 


SPY200660 


59% 


94.1% 


96.8% 


ECO100430 


TPA100504 


56% 


93.6% 


95.4% 


ECO100430 


VCH101891 


84% 


100% 


100% 


ECO100430 


YPS001881 


92% 


100% 


100% 


ECO100431 


ABA 100033 


61% 


24.2% 


90.3% 


ECO100431 


ABA105304 


55% 


97.6% 


95.2% 


ECO 100431 


BAN 104528 


44% 


965% 


99.6% 


ECO100431 


BAN103992 


53% 


97.7% 


96.3% 


ECO100431 


BFR11925 


45% 


97.4% 


94.3% 


ECO100431 


BPT100489 


68% 


98.2% 


95.5% 


ECO100431 


BBU100612 


39% 


97.7% 


97.3% 


ECO100431 


BCE108504 


69% 


98.2% 


95.3% 


ECO100431 


BFU100993 


70% 


98.5% 


95.5% 


ECO100431 


BMA107557 


69% 


99.4% 


96.6% 


ECO100431 


CJU100999 


39% 


97.1% 


98.4% 


ECO100431 


CPN200737 


40% 


97.8% 


95.1% 


ECO100431 


CTR200613 


40% 


97.1% 


94.6% 


ECO100431 


CAC100342 


47% 


97.1% 


98.1% 


ECO100431 


CBO101096 


50% 


98.0% 


99.6% 


ECO100431 


CDF102998 


52% 


97.2% 


97.3% 


ECO100431 


EBC101544 


98% 


54.5% 


100% 


ECO100431 


ECO100431 


100% 


100% 


100% 


ECO100431 


HIN100442 


74% 


98.3% 


96.3% 


ECO 100431 


HPY10I359 


39% 


97.1% 


98.9% 


ECO100431 


KPN301084 


96% 


62.6% 


100% 


ECO100431 


MCA100771 


52% 


98.9% 


96.8% 


ECO100431 


MAV107390 


50% 


47.6% 


95.6% 


ECO100431 


MLP100403 


40% 


8.2% 


19.4% 


ECO100431 


MGE100244 


41% 


96.2% 


982% 


ECO100431 


MPN100504 


40% 


96.2% 


982% 


ECO100431 


NGO100249 


65% 


98.1% 


95.5% 


ECO100431 


NME201277 


65% 


98.9% 


97.0% 
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Homolog LocusID 


Identity 


Query Coverage 


Homolog Coverage 1 


ECO100431 


PMU101978 


75% 


99.4% 


96.8% 


ECO100431 


PRT101500 


88% 


100% 


100% 


ECO 100431 


PAE201802 


70% 


97.8% 


96.0% 


ECO100431 


PPU106533 


70% 


97.8% 


96.0% 


ECO100431 


PSY103927 


69% 


97.8% 


96.0% 


ECO100431 


SPA102997 


99% 


100% 


98.4% 


ECO100431 


STY100726 


99% 


100% 


100% 


ECO100431 


STM100439 


99% 


100% 


100% 


ECO100431 


SPN401780 


25% 


15.9% 


41.4% 


ECO100431 


SPY101613 


24% _j 


13.8% 


42.9% 


ECO100431 


TPA100519 


43% 


98.3% 


87.7% 


ECO100431 


UUR100350 


40% 


96.7% 


99.0% 


ECO100431 


VCH101890 


82% 


100% 


99.6% 


ECO100431 


YPS001875 


91% 


100% 


100% 


ECO100435 


ABA106138 


24% 


96.2% 


84.2% 


ECO100435 


BFR102134 


27% 


79.5% 


78.4% 


ECO100435 


BFU1O4105 


26% 


95.5% 


83.9% 


ECO100435 


CAC102936 


39% 


55.3% 


54.1% 


ECO100435 


EBC102273 


87% 


99.2% 


99.2% 


ECO100435 


ECO100435 


100% 


100% 


100% 


ECO100435 


KFN302385 


85% 


98.5% 


97.7% 


ECO100435 


LPN 102857 


30% 


85.6% 


83.1% 


ECO100435 


MCA100454 


24% 


78.8% 


64.6% 


ECO100435 


MAV107513 


22% 


97.0% 


94.2% 


ECO100435 


MBV101527 


26% 


96.2% 


92.8% 


ECO100435 


MTU202439 


26% 


96.2% 


92.8% 


ECO100435 


NGO101290 


40% 


96.2% 


97.6% 


ECO100435 


NME200457 


40% 


96.2% 


97.6% 


ECO100435 


PRT106164 


55% 


97.0% 


94.8% 


ECO100435 


SPA102993 


93% 


100% 


100% 


ECO100435 


STY100750 


94% 


100% 


100% 


ECO100435 


STM100443 


94% 


100% 


100% 


ECO100435 


SPN401264 


26% 


75.8% 


33.1% 


ECO100435 


TPA100154 


37% 


90.2% 


88.8% 


ECO100435 


YPS001867 


69% 


99.2% 


97.0% 


ECO 100445 


BAN103966 


24% 


61.1% 


4.8% 


ECO100445 


BAN105784 


24% 


61.1% 


43.0% 


ECO100445 


BMA103371 


31% 


43.7% 


23.7% 


ECO100445 


CJU101094 


30% 


44.2% 


93.3% 


ECO100445 


EBC101949 


75% 


81.6% 


98.1% 


ECO100445 


ECO100445 


100% 


100% 


100% 


ECO100445 


KPN302393 


70% 


84.2% 


98.8% 


ECO100445 


MAV100663 


25% 


55.3% 


53.0% 


ECO100445 


PAE203672 


38% 


54.2% 


77.3% 


ECO100445 


PPU105141 


40% 


52.1% 


74.2% 


ECO100445 


PSY107733 


35% 


53.2% 


98.1% 


ECO100445 


SPA103161 


84% 


100% 


100% 


ECO100445 


STY 100781 


88% 


.100% 


100% 


ECO100445 


STM100473 


88% 


100% 


100% 


ECO100445 


VCH101045 


43% 


47.9% 


63.6% 


ECOl 00445 


YPS001806 


55% 


87.4% 


94.2% 


ECO100448 


ECO100448 


100% 


100% 


100% 


ECO100448 


KPN302401 


62% 


48.5% 


100% 


ECO100448 


PAE202198 


27% 


94.8% 


95.1% 
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Identity 


Query Coverage 


Homolog Coverage 


ECO100448 


PPU101997 


30% 


97.1% 


93,8% 


ECO100448 


SPA103158 


77% 


49.2% 


100% 


ECO100448 


STY100783 


67% 


99.2% 


99.6% 


ECO100453 


ABA101792 


57% 


99.1% 


99.3% 


ECO100453 


BPT100257 


67% 


98.8% 


97.6% 


ECO 100453 


BCE1 14545 


66% 


24.5% 


84.0% 


ECO 100453 


BCE104032 


66% 


99.5% 


98.6% 


ECO100453 


BFU105569 


65% 


100% 


98.2% 


ECO100453 


BMA106830 


66% 


99.9% 


98.2% 


ECO100453 


EBC103991 


92% 


100% 


100% 


ECO100453 


ECO 100453 


100% 


100% 


100% 


ECO100453 


HIN 100875 


32% 


97.9% 


97.4% 


ECO100453 


KPN303893 


91% 


100% 


100% 


ECO100453 


LPN101626 


40% 


98.9% 


99.4% 


ECO100453 


MCA100461 


56% 


99.9% 


99.7% 


ECO100453 


MAV103904 


24% 


18.5% 


20.4% 


ECO100453 


MBV105073 


23% 


19.9% 


19.1% 


ECO 100453 


MTU201501 


23% 


19.5% 


16.7% 


ECO100453 


NGO100607 


48% 


99.9% 


98.7% 


ECO100453 


NME201818 


49% 


99.9% 


98.7% 


ECO100453 


PMU101132 


31% 


97.6% 


97.1% 


ECO100453 


PRT104879 


75% 


99.3% 


99.2% 


ECO100453 


PAE200425 


70% 


98.5% 


98.6% 


ECO100453 


PPU101135 


65% 


99.0% 


98.7% 


ECO100453 


PSY103933 


65% 


99.3% 


99.5% 


ECO100453 


STY100789 


94% 


99.9% 


99.9% 


ECO 100453 


YPS001789 


84% 


99.8% 


99.6% 


ECO100456 


BCE1 14237 


32% 


23.7% 


57 2% 


ECO100456 


BFU101219 


28% 


27.1% 


67.2% 


ECO100456 


CPN200570 


19% 


22.7% 


58.7% 


ECO100456 


CTR1 00201 


19% 


37.5% 


48.0% 


ECO100456 


CDP102443 


27% 


12.8% 


29.9% 


ECO100456 


EBC103983 


84% 


97.2% 


99.9% 


ECO100456 


ECO100456 


100% 


100% 


100% 


ECO100456 


HIN1O0185 


35% 


97.2% 


98.4% 


ECOl 00456 


KPN300148 


83% 


14.6% 


100% 


ECO100456 


KPN303886 


83% 


99.7% 


97.9% 


ECO100456 


LPN100506 


21% 


82.6% 


90.8% 


ECO100456 


MLP100103 


19% 


19.0% 


26.4% 


ECO100456 


MPN100367 


22% 


38.7% 


9.5% 


ECO100456 


PMU100358 


38% 


97.7% 


98.5% 


ECO100456 


PRT105555 


48% 


97.3% 


98.3% 


ECO100456 


PAE205017 


39% 


97.1% 


97.0% 


ECO100456 


PPU102036 


40% 


97.4% 


98.5% 


ECO100456 


PSY103682 


41% 


94.7% 


93.7% 


ECO100456 


SPA102914 


88% 


95.2% 


100% 


ECO100456 


STY100812 


89% 


99.8% 


99.8% 


ECO100456 


STM 100504 


89% 


99.7% 


99.7% 


ECO 100456 


UUR100392 


24% 


13.8% 


11.3% 


ECO100456 


YPS001764 


59% 


97.4% 


96.3% 


ECO100457 


EBC106191 


73% 


84.9% 


73.8% 


ECO100457 


ECO100457 


100% 


100% 


100% 


ECO100457 


KPN303890 


75% 


84.9% 


78.9% 


ECO 100457 


PRT1 05806 


58% 


94.3% 


94.1% 
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Identity 


Query Coverage 


HomoToR Coverage ~ 


ECO100457 


SPA102916 


80% 


92.5% 


92.7% 


ECO 100457 


STY100814 


80% 


92.5% 


92.7% 


ECO 100457 


VCH101828 


58% 


81.1% 


61.4% 


ECO100457 


YPS003930 


57% 


84.9% 


88.2% 


ECO100458 


CDP100984 


27% 


56% 


24.8% 


ECO 100458 


EBC1 03989 


56% 


100% 


100% 


ECO100458 


ECO 100458 


100% 


100% 


100% 


ECO100458 


KPN300488 


55% 


92% 


100% 


ECO100458 


KPN303892 


59% 


100% 


100% 


ECO100458 


PMU100990 


20% 


97.1% 


95.1% 


ECO100458 


PRT104644 


35% 


98.3% 


97.2% 


ECO100458 


PAE108407 


28% 


88.6% 


23.8% 


ECO100458 


SPA102917 


72% 


97.7% 


100% 


ECO100458 


VCH 101827 


32% 


81.1% 


79.1% 


ECO100458 


YPS001761 


36% 


98.3% 


97.2% 


ECO 100464 


ABA105152 


52% 


99.7% 


99.4% 


ECO 100464 


BFR105086 


32% 


90.2% 


82.4% 


ECO100464 


BPT101320 


59% 


99.2% 


98.1% 


ECO100464 


BBU100559 


41% 


98.4% 


93.4% 


ECO100464 


BCE101760 


60% 


99.4% 


99.1% 


ECO 100464 


BFU107645 


61% 


99.5% 


98.9% 


ECO100464 


BMA108677 


61% 


98.7% 


99.2% 


ECO100464 


CJU100481 


43% 


98.4% 


98.8% 


ECO100464 


CAC101040 


39% 


98.4% 


99.0% 


ECO100464 


CBO103036 


41% 


42.6% 


99.6% 


ECO 100464 


CDF102736 


29% 


96.0% 


95.7% 


ECO100464 


EBC103981 


91% 


100% 


100% 


ECO 100464 


ECO100464 


100% 


100% 


100% 


ECO100464 


HIN100103 


. 75% 


99.5% 


98.7% 


ECO100464 


HPY100206 


45% 


98.6% 


98.4% 


ECO100464 


KPN303864 


91% 


100% 


100% 


ECO100464 


LPN102583 


62% 


99.5% 


992% 


ECQ100464 


LMO102556 


23% 


98.6% 


982% 


ECO100464 


MCA100562 


50% 


98.6% 


98.6% 


ECO 100464 


MAV101501 


48% 


97.1% 


96.7% 


ECO 100464 


MBV105727 


45% 


98.4% 


98.0% 


ECOl 00464 


MLP100990 


45% 


98.4% 


98.0% 


ECO100464 


MTU202265 


45% 


98.4% 


98.0% 


ECO 100464 


NGO100474 


25% 


19.9% 


34.3% 


ECO100464 


NME200399 


25% 


19.9% 


18.6% 


ECO100464 


PMU1 01024 


76% 


99.2% 


98.4% 


ECO100464 


PRT105522 


81% 


99.7% 


99.0% 


ECO100464 


PAE201595 


61% 


99.4% 


98.4% 


ECO100464 


PPU101233 


60% 


99.4% 


98.7% 


ECO100464 


PSY105027 


60% 


99.4% 


98.7% 


ECO100464 


SPA100495 


91% 


91.7% 


99.7% 


ECO100464 


STY100841 


94% 


100% 


98.7% 


ECO 100464 


STM100533 


94% 


100% 


98.7% 


ECO 100464 


TPA1 00974 


41% 


98.4% 


98.7% 


ECO 100464 


VCH 100968 


69% 


99.2% 


98.1% 


ECO100464 


YPS001737 


86% 


100% 


99.7% 


ECO 100465 


ABA102578 


62% 


99.5% 


99.1% 


ECO100465 


BAN108855 


43% 


100% 


98.6% 


ECO100465 


BAN106781 


47% 


100% 


98.6% 



570 



WO 02/077183 PCT/US02/09107 



Ouerv LocusID 


Homolog LocusID 


Identity 


Query Coverage ] Homolog Coverage 


ECO 100465 


BFR10586 


39% 


OR AO/ 
yo.0 /o 


/O 


ECO 100465 


BPT101280 


60% 


QO CO/ 


sfy . i /o 


ECO 100465 


BBU100416 


35% 


00 co/ 


OR 1°/ 


ECO 100465 


BCE1 14693 


65% 


OA 10/ 


OA 4% 
f u.n /o 


ECO100465 


BFU106856 


65% 


OA W~ 


OA 4% 


ECO 100465 


BMA 102106 


65% 


OA IV* 


06 R% 

7U.O /O 


ECO 100465 


CJU100600 


25% 


OR IVa 


04 10/ 

y*T.O /0 


ECO 100465 


CPN200508 


34% 


Rl A0/» 


R1 A©/ 
o j.o/o 


ECO100465 


CTR200398 


32% 


04 00/. 


OO AO/ 
OJiU/0 


ECO 100465 


CAC 102891 


48% 


1V/V//0 


00 10/ 

yy. l /o 


ECO 100465 


CBO103787 


51% 


1 AAO/ 


OR AO/ 

yo.o /o 


ECOl 00465 


CDF102350 


52% 


inno/ R 

lUU/o 


OA 40/ 

yo.4 /o 


ECO100465 


CDP100003 


40% 


QA 40/ 
o 0.4/0 


OT "50/ 


ECO 100465 


EBC103984 


96% 


1 AAO/ 

lUU/o 


1 AAO/ 


ECO 100465 


EFA201978 


45% 


00 «.% 


OR Ao/ 


ECO100465 


EFM200623 


45% 


00 


QO 10/ 

yy*i /o 


ECO100465 


ECOl 00465 


100% 


i Ano/ 

I \AJ/0 


1 AAO/ 
1 W7© 


ECO100465 


HIN100331 


71% 


lUVTO 


1 AAO/ 
1UU/0 


ECO 100465 


HPY1 00611 


30% 


A1 0©/. 

0 1 .A /O 


TO 10/ 


ECO 100465 


KPN303865 


96% 


3 AAO/ 


1 AAO/ 


ECO100465 


LPN102084 


55% 


OQ CO/ 

99.5% 


Art |0/ 

99.170 


ECO 100465 


LMO102187 


46% 


no co/ 

99.5% 


no cxt/ 
9o.O% 


ECO 100465 


MCA101231 


61% 


oo co/ 

99.5% 


no ro/ 
9o.O% 


ECO 100465 


MAV102071 


38% 


100% 


on >to/ 
99.4% 


ECO 100465 


MBV101628 


37% 


1 AAO/ 
1UU% 


no y*o/ 
99.4% 


ECO100465 


MLP101114 


39% 


or ylO/ 


Q*7 10/ 
O / .J 70 


ECO 100465 


MTU200730 


37% 


1 AAA/ 
iUU% 


no ><o/ 
99.4% 


ECO 100465 


MGE100174 


34% 


O/f ro/ 

o4.0% 


qa cat 

84.0% 


ECO 100465 


MPN 100646 


32% 


QfL AO/ 


e/s CO/ 
50.5% 


ECO100465 


NGO100911 


62% 


1 A AO/ 

1U0% 


AQ CO/ 

99.5% 


ECO100465 


NME200954 


64% 


1 AAO/ 

100% 


on co/ 
99.5% 


ECO100465 


PMU 100284 


71% 


on CO/ 


nn co/ 
99.5% 


ECO 100465 


PRT1 04973 


81% 


100% 


t AAA/ 

100% 


ECO100465 


PAE203684 


64% 


Or TO/ 

9o.7% 


0£ TO/ 

96.7% 


ECO 100465 


PPU107935 


64% 


Q< *70/ 

90.7% 


or 00/ 
90.o% 


ECO100465 


PSY104919 


60% 


Qt 10/ 


OA 00/ 

9o.z% 


ECO100465 


SPA100230 


92% 


1 AAO/ 

1UU% 


ni qo/ 
9 i .o% 


ECO100465 


STY100842 


96% 


1 AAO/ 


1 AAO/ 


ECO100465 


STM100555 


96% 


1 AAO/ 

lUUTo 


1 AAO/ 

100% 


ECO100465 


SAU802229 


47% 


no co/ 
99.5% 


oo ro/ 
98.6% 


ECO100465 


SEP200243 


47% 


OO CO/ 

99.5% 


98.6% 


ECO 100465 


SHA100179 

Wi A A W A $ ✓ 


47% 


99.5% 


98.1% 


ECO100465 


SMU100597 


43% 


85.0% 


85.8% 


ECO100465 


SPN400210 

A ' 1 W w A* A V 


38% 


99.5% 


98.1% 


ECO 100465 


SPY200058 


40% 


99.5% 


98.1% 


ECO 100465 


TP A 100588 


38% 


98.6% 


97.2% 


ECO 100465 


UUR100253 


33% 


00 s% 

77J /O 


OR 1% 
yo t i /o 


ECO100465 


VCH100969 


73% 


99.5% 


99.5% 


ECO100465 


YPS001716 


87% 


100% 


100% 


ECO100468 


ABA104316 


27% 


50.7% 


63.8% 


ECO 100468 


EBC103990 


93% 


100% 


100% 


ECOl 00468 


ECO 100468 


100% 


100% 


100% 


ECO 100468 


KPN303867 


89% 


100% 


100% 


ECO100468 


MCA101546 


24% 


57.4% 


72.1% 



571 



WO 02/077183 PC^IJ^2^9107 



Query LocusID 


Homolog LocusID 


Identity 


Query Coverage 


Homolog Coverage 


ECO 100468 


PRT103854 


82% 


100% 


100% 


ECOl 00468 


SPA1 07646 


92% 


100% 


100% 


ECO100468 


STY1 00845 


94% 


100% 


100% 


ECO100468 


STM100558 


94% 


100% 


100% 


ECO100468 


VCH103521 


69% 


100% 


100% 


ECO100468 


VCH101110 


75% 


100% 


100% 


ECO100468 


YPS001600 


83% 


100% 


100% 


ECO100469 


ABA105707 


48% 


97.3% 


97.0% 


ECO100469 


BAN105697 


24% 


61.3% 


85.4% 


ECO100469 


BAN103529 


27% 


63.6% 


89.8% 


ECO 100469 


BAN105257 


27% 


67.6% 


93.7% 


ECO 100469 


BFR10916 


30% 


63.8% 


48.5% 


ECO100469 


BFR11614 


32% 


69.9% 


50.9% 


ECO100469 


BBU 100446 


21% 


69.4% 


57.9% 


ECO100469 


BCE111018 


51% 


97.7% 


97.5% 


ECO100469 


BMA106790 


50% 


99.3% 


98.4% 


ECO100469 


CBO103826 


23% 


65.9% 


91.1% 


ECO100469 


CDP101177 


26% 


90.1% 


93.7% 


ECO100469 


EBC105058 


92% 


98.2% 


98.4% 


ECO100469 


EFM201852 


28% 


67.9% 


96.1% 


ECO100469 


ECO100469 


100% 


100% 


100% 


ECO100469 


KPN309208 


91% 


100% 


100% 


ECO100469 


MAV102680 


29% 


64.9% 


93.2% 


ECO100469 


MBV104698 


29% 


61.5% 


86.2% 


ECO100469 


MLP 100477 


27% 


61.5% 


86.5% 


ECO100469 


MTU203193 


29% 


61.5% 


862% 


ECO100469 


PRT101586 


71% 


98.0% 


95.4% 


ECO100469 


PAE205513 


62% 


98.0% 


97.0% 


ECO 100469 


PPU103619 


33% 


68.3% 


64.0% 


ECO100469 


SPA103965 


83% 


98.2% 


100% 


ECO 100469 


STY 100846 


94% 


99.6% 


99.6% 


ECO100469 


SAU401722 


23% 


66.8% 


85.2% 


ECO100469 


TPA100056 


23% 


37.6% 


47.1% 


ECO100469 


YPS001597 


79% 


98.2% 


97.3% 


ECO100473 


BCE1 12818 


22% 


90.5% 


79.4% 


ECO 100473 


BFU105062 


27% 


87.5% 


92.5% 


ECO 100473 


BMA109015 


23% 


55.7% 


93.1% 


ECO 100473 


CAC101805 


21% 


87.1% 


80.6% 


ECO100473 


EBC101032 


70% 


51.1% 


100% 


ECO100473 


ECO100473 


100% 


100% 


100% 


ECO100473 


KPN305997 


67% 


96.2% 


99.2% 


ECO100473 


PRT105875 


47% 


99.2% 


97.8% 


ECO100473 


SPA101218 


64% 


100% 


100% 


ECO100473 


STY100870 


65% 


100% 


100% 


ECO100473 


VCH102186 


33% 


88.6% 


80.2% 


ECO100473 


YPS001582 


56% 


99.2% 


97.4% 


ECO100475 


ABA101879 


41% 


7.2% 


16.0% 


ECO100475 


BAN105978 


44% 


57.4% 


98.6% 


ECO100475 


BAN1 10321 


44% 


69.3% 


100% 


ECOl 00475 


BFR10291 


39% 


86.6% 


98.1% 


ECO 100475 


BPT102136 


41% 


6.8% 


15.8% 


ECO 100475 


BCE1 10602 


43% 


73.5% 


97.9% 


ECO 100475 


BFU1 13139 


47% 


72.8% 


81.7% 


ECOl 00475 


BMA106854 


46% 


73.3% 


73.4% 



I 
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Query Locus ID 


Homolog LocusID 


Identity 


Query Coverage 


Homolog Coverage 


ECOl 00475 


CJU101080 


30% 


86.9% 


89.4% 


ECO100475 


CAC100879 


39% 


18.9% 


7.9% 


ECO100475 


CBOl 02754 


38% 


99.2% 


99.5% 


ECO100475 


CDF1 02644 


39% 


4.6% 


16.2% 


ECO100475 


CDP1 00778 


38% 


5.2% 


7.9% 


ECO100475 


EBC100331 


89% 


4.2% 


19.6% 


ECO100475 


EFA201855 


39% 


19.2% 


7.9% 


ECO100475 


EFM201092 


37% 


87.5% 


98.5% 


ECO100475 


ECOl 00475 


100% 


100% 


100% 


ECO100475 


HIN100276 


37% 


3.2% 


10.7% 


ECO100475 


HPY101055 


32% 


88.0% 


99.6% 


ECO100475 


KPN308679 


87% 


6.6% 


18.6% 


ECO100475 


LPN101141 


44% 


77.7% 


89.0% 


ECO100475 


LPN101253 


45% 


77.7% 


88.7% 


ECO100475 


LMO100878 


37% 


1.4% 


12.6% 


ECO100475 


MCA100406 


36% 


87.5% 


95.5% 


ECO100475 


MAV1 00988 


36% 


72.9% 


81.5% 


ECO100475 


MBV103852 


44% 


77.7% 


79.7% 


ECO100475 


MLP101222 


34% 


87.4% 


96.5% 


ECO100475 


MTU200960 


44% 


77.7% 


81.9% 


ECO100475 


NGO100160 


39% 


86.7% 


98.1% 


ECOl 00475 


NME201398 


40% 


87.2% 


98.6% 


ECO100475 


PMU101892 


39% 


2.9% 


10.9% 


ECO100475 


PRT105140 


72% 


27.5% 


12.7% 


ECO100475 


PAE203917 


41% 


7.1% 


163% 


ECO100475 


PPU108258 


41% 


99.6% 


98.6% 


ECOI00475 


PSY102170 


42% 


7.0% 


21.1% 


ECO100475 


SPA101217 


84% 


86.8% 


100% 


ECO100475 


STY100871 


93% 


99.9% 


99.9% 


ECO100475 


STM1 04228 


42% 


4.4% 


8.3% 


ECOl 00475 


SAU802557 


38% 


99.9% 


99.5% 


ECO100475 


SEP201193 


37% 


7.2% 


16.6% 


ECO100475 


SHA100292 


40% 


73.9% 


100% 


ECO100475 


SMU100795 


41% 


4.4% 


8.9% 


ECO 100475 


SPN400641 


. 43% 


72.1% 


79.3% 


ECO100475 


SPY201317 


41% 


4.6% 


8.9% 


ECO 100475 


TPA1O1026 


.34% 


3.7% 


9.8% 


ECOl 00475 


UUR100203 


28% 


76.9% 


92.9% 


ECO100475 


VCH102181 


50% 


9.0% 


19.3% 


ECO100475 


YPS001578 


66% 


7.3% 


22.9% 


ECO100485 


ABA102979 


44% 


78.8% 


90.3% 


ECO 100485 


BPT100402 


42% 


86.5% 


82.4% 


ECO100485 


BCE103660 


39% 


98.1% 


603% 


ECO100485 


BFU101347 


38% 


93.3% 


90.2% 


ECO100485 


BMA107037 


42% 


86.1% 


81.2% 


ECO100485 


CAC101080 


25% 


84.1% 


98.9% 


ECO 100485 


CBO 103052 


27% 


48.6% 


52.1% 


ECO100485 


CDF1 04317 


23% 


79.3% 


92.3% 


ECO100485 


EBC103361 


92% 


90.9% 


100% 


ECO 100485 


EFA202370 


25% 


74.0% 


85.1% 


ECO100485 


ECO100485 


100% 


100% 


100% 


ECO 100485 


KPN303861 


87% 


100% 


95.4% 


ECO100485 


LMO100084 


26% 


80.3% 


96.4% 


ECO100485 


NGO100490 


35% 


75.5% 


71.8% 
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Query LocusID 


Homolog LocusID 


Identity 


Ouerv Coverage 


Homolog Coverage 


ECO100485 


NME201781 


35% 


75.5% 


71.8% 


ECO 100485 


PRT104443 


57% 


89.4% 


97.9% 


ECO100485 


PAE202854 


49% 


91.3% 


95.5% 


ECO100485 


PPU1 11360 


45% 


93.8% 


91.2% 


ECO100485 


PSY100343 


47% 


75% 


98.8% 


ECO100485 


SPA101077 


87% 


98.1% 


94.9% 


ECO100485 


STY100879 


89% 


98.1% 


100% 


ECO100485 


SPY201146 


27% 


84.1% 


55.6% 


ECO100485 


VCH103503 


56% 


90.9% 


88.8% 


ECO100485 


YPS001527 


71% 


98.1% 


98.1% 


ECO 100488 


ABA100225 


26% 


47.2% 


31.6% 


ECO100488 


ABA100477 


26% 


27.1% 


80.5% 


ECO100488 


BAN 104847 


21% 


28.1% 


4.3% 


ECO100488 


BAN109223 


31% 


7.2% 


48.7% 


ECO100488 


BAN101440 


31% 


7.6% 


43.5% 


ECO100488 


BAN1 10288 


23% 


20.8% 


3.8% 


ECO 100488 


BFR1 04422 


59% 


1.9% 


27% 


ECO100488 


BCE108121 


26% 


28.9% 


68.0% 


ECO100488 


BCE104951 


26% 


49.5% 


30.5% 


ECO100488 


BFU107935 


29% 


21.0% 


31.4% 


ECO100488 


BFU100092 


51% 


6.2% 


98.9% 


ECO100488 


BFU100109 


28% 


9.3% 


60.6% 


ECO100488 


BFU102581 


29% 


11.2% 


4.4% 


ECO100488 


BMA107682 


26% 


25.9% 


53.9% 


ECO100488 


CAC100404 


28% 


7.9% 


39.1% 


ECO 100488 


EBC107494 


33% 


21.5% 


50.7% 


ECO 100488 


EBC103412 


28% 


21.5% 


25.6% 


ECO100488 


EBC104888 


26% 


28.5% 


68.9% 


ECO100488 


ECO100686 


77% 


23.5% 


70.2% 


ECO100488 


ECO101427 


86% 


41.4% 


86.8% 


ECO100488 


ECO100683 


77% 


882% 


89.3% 


ECO100488 


ECO103515 


77% 


88.2% 


90.6% 


ECO100488 


ECO103405 


74% 


99.6% 


99.7% 


ECO100488 


ECO100488 


100% 


100% 


100% 


ECO100488 


MAV107400 


23% 


19.1% 


20.6% 


ECO 100488 


PRT103688 


29% 


7.3% 


73.6% 


ECO100488 


PRT103361 


31% 


7.6% 


75.1% 


ECOl 00488 


PRT103421 


25% 


11.2% 


4.4% 


ECO100488 


PAE202682 


31% 


88.3% 


88.8% 


ECO100488 


PPU107712 


30% 


7.6% 


94.3% 


ECOl 00488 


PPU109654 


27% 


6.8% 


73.5% i 


ECO100488 


PPU1 10484 


28% 


12.0% 


92.3% 


ECO100488 


PPU107101 


28% 


10.9% 


38.6% 


ECO 100488 


PPU109653 


28% 


41.0% 


73.7% 


ECO100488 


PPU109652 


25% 


32.9% 


38.8% 


ECOl 00488 


PPU1 10482 


29% 


53.6% 


54.8% 


ECO100488 


PSY108533 


33% 


4.9% 


83.6% 


ECO100488 


PSY101322 


55% 


4.1% 


24.3% 


ECO 100488 


PSY105816 


47% 


7.2% 


45.3% 


ECO100488 


PSY102083 


25% 


44.4% 


452% 


ECO100488 


SPA106438 


38% 


2.5% 


16.8% 


ECOl 00488 


SPA100247 


26% 


53.2% 


67.7% ! 


ECO100488 


STY104094 


26% 


30.2% 


50.1% 


ECO100488 


STY104095 


30% 


59.5% 


51.1% 
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Query LocusID 


Homolog LocusID 


Identity 


Ouerv Coverage 


Homolog Coverage 


ECO100488 


STM 103807 


42% 


8.0% 


50.8% 


ECO100488 


STM103806 


26% 


40.7% 


49.6% 


ECO100488 


YPS002860 


25% 


84.9% 


75.6% 


ECO 100490 


ABA100225 


43% 


36.9% 


5.8% 


ECOl 00490 


ABA100477 


45% 


38.6% 


10.5% 


ECO100490 


BAN101440 


31% 


39.4% 


37.8% 


ECOl 00490 


BAN109223 


31% 


39.4% 


44.2% 


ECO100490 


BAN 110288 


33% 


39.4% 


4.4% 


ECO100490 


BCE 104951 


36% 


50.4% 


8.6% 


ECO 100490 


BCE108121 


46% 


39.0% 


10.1% 


ECO100490 


BFU100092 


48% 


36.9% 


97.8% 


ECO100490 


BFU102581 


51% 


39.0% 


6.1% 


ECO100490 


BFU100109 


47% 


41.1% 


14.2% 


ECO100490 


BMA107682 


35% 


38.6% 


8.0% 


ECO100490 


CAC100404 


32% 


35.6% 


30.3% 


ECO100490 


EBC103412 


43% 


38.1% 


33.2% 


ECO100490 


ECO100686 


73% 


39.8% 


19.7% 


ECO100490 


ECO1035I7 


66% 


45.8% 


45.5% 


ECO 100490 


ECO103515 


75% 


39.4% 


6.8% 


ECO100490 


ECO103405 


76% 


39.8% 


6.7% 


ECO100490 


ECO100683 


76% 


39.4% 


6.7% 


ECO100490 


ECO100488 


80% 


39.4% 


6.5% 


ECO100490 


ECO101427 


42% 


92.4% 


31.4% 


ECO100490 


ECO100490 


100% 


100% 


100% 


ECO100490 


PRT1 03361 


33% 


38.6% 


65.5% 


ECO100490 


PRT103421 


45% 


39.4% 


7.7% 


ECO100490 


PAE202682 


51% 


39.4% 


7.0% 


ECO100490 


PPU107712 


49% 


23.3% 


63.2% 


ECO 100490 


PPU107101 


48% 


38.6% 


22.0% 


ECO100490 


PPU1 10482 


46% 


39.0% 


6.6% 


ECO 100490 


PPU1 09652 


45% 


40.7% 


6.4% 


ECOl 00490 


PSY101322 


55% 


24.6% 


23.9% 


ECOl 00490 


PSY105816 


46% 


39.0% 


40.8% 


ECO100490 


PSY102083 


47% 


39.0% 


5.8% 


ECO100490 


SPA106438 


46% 


37.3% 


40% 


ECO100490 


STY104095 


48% 


38.6% 
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00 5% 


yy.y /o 


ECO 100632 

A-<V^ V A WUJA 


CTR 200474 


40% 


00 cqz 

77.J /0 


OO 0% 

yy.yyo 


ECO 100632 


CAP 101 070 


40% 


00 0% 

77,V/70 


OO 1% 

yy. i /o 


ECO 100632 


CBO 100061 

vOU 1VU7U A 


40% 

*TV/0 


08 8% 


00 fi% 

yy.uvo 


ECO 100632 


CDFl 03555 


44% 


00 4% 


OO 

yy.^/o 


ECO 100632 


CDP101454 


36% 

Jv SO 


00 0% 
yy.yj /o 


07 1% 

y / . i /o 


ECO100632 


EBP1O0530 


02% 


55 9% 


IfiAO/ 
lUvTo 


ECO 100632 


EFA200538 


42% 


00 3% 
yy.3 /o 


OO 5% 

yy.j/o 


ECO 100632 


FFM200610 


43% 


00 3©/. 


oo <o/ 
yy.jyo 


FCO 10063? 


FC0 10063? 


inn% 

lUv/o 


1 AA0/ 


1 AAO/ 


ECO 100632 


HTM100000 

AXAXN 1 vVyVU 


79% 


OQ O0A 

yy.yyo 


oo fioy 
yy.oyo 


ECO 100632 


HPV101 594 
axX X IvlJ^n 


AW* 


oc eo/ 
yo.oyo 


yy.3% 


FCOl 00639 


V'PXT'I AA<R 1 
lu IN jw^O 1 


y / /o 


lo.4% 


96.3% 


ECO100639 


ifp>j^n96j^t 

aVtxN j 00*t 


7J/0 


OO 00/ 

yy.oyo 


1 AAO/ 

100% 


ECO 100632 


T PM 103397 




iuuyo 


1 AAO/ 

100% 


ECQ100632 


LMO1006U 


42% 


99.3% 


99.5% 


ECO 100632 


MCA102972 


49% 


99.7% 


99.0% 


ECO100632 


MAV103767 


37% 


96.3% 


90.9% 


ECO100632 


MBV104696 


37% 


96.2% 


92.8% 


ECO 100632 


MLP100028 


36% 


99.9% 


97.7% 


ECO 100632 


MTU200041 


37% 


96.2% 


93.0% 


ECO 100632 


MGE100272 | 34% 


99.4% 


99.6% 
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nOulOlOg i^O^uaix^ 


Identitv 

****** A U tjf 


Ouerv Coverage 


Homolog Coverage 


fcvvUlUUo32 


K/fPXJ 100453 


34% 


99.4% 


99.5% 


i?/t\i aaxio 
fcCUlUUo32 


unm ooo5Q 


57% 


99.9% 


99.8% 


ctAJl00o32 


>JTVvrP70fi*I1 8 
INMli2UvD 1 o 


55% 


99.9% 


100% 


cLU100o32 


pvrr T10171 A 


72% 


99.8% 


99.8% 




PPT1 07RR7 


80% 


100% 1 


100% 


XJUU1UU032 


PAP703QR4 


57% 


99.8% 


99.8% 


EUU1W03Z 


PPTT10RR17 
rr u ivoo i / 


57% 


99.8% 


99.8% 




P^V 105077 


57% 


99.8% 


99.8% 


DwU i UUO 


qpa 100430 


93% 


58.6% 


98.8% 




<n*vioi466 


95% 


100% 


100% 




9AHR01760 


42% 


99.3% 


99.5% 




C|7P70717t 


42% 


99.3% 


99.5% 




ctja 100631 


42% 


99.3% 


99.5% 




oivvrf Tl 004Q7 


40% 


99.4% 


99.5% 




^PT\J400735 


42% 


99.4% 


99.5% 


PPH1 00637 


SPY200128 


42% 


99.4% 


99.5% 




TP A 100579 


39% 


98.8% 


992% 


PP0 100637 


TJTJR100373 


34% 


99.4% 


99.9% 


ppm 00637 


VCH1 00940 


75% 


99.8% 


95.7% 


ppoioo637 I 


YPS001199 


84% 


100% 


100% 




ABA 101420 

/IDA 1 V 1 tXU 


41% 


99.0% 


100% 


CPA 1 00645 


RPT1 02527 

i->i A iUiJ*. / 


46% 


98.3% 


96.7% 


pprnoofi4< 


RPF1041 1 1 


65% 


100% 


99% 




UPT110Q6Q7 


45% 


99.0% 


85.4% 


rr«Ai AAAJC 


RMA107165 


72% 


86.8% 


100% 


pprvi 00 
£vvL/AvAJ0*O 


PTCP107Q33 


91% 

^ A /0 


100% 


100% 


CUU I UUD*0 


Ppnioo645 


100% 


100% 


100% 




KPN300407 


91% 


41.7% 


100% 


ppd i 00645 


JCPN302708 

JVA I ^ «J /I/O 


89% 


100% 


100% 




PPT1 07877 


75% 


100% 


100% 


Ppnioo645 


PAE201341 


58% 


97.4% 


97.7% 


PPH1 (Y\£A^ 


PPT Tl 01 496 


57% 


96.4% 


97.4% 


PPfil 00645 


PW 105 194 

AO A A V/J 1 7*t 


55% 


96.4% 


94.8% 


PP0 100645 


Qp A 102587 


95% 


92.1% 


100% 


PPf>1 00645 


^TV 101796 

Oil 1U1 /7U 


93% 


100% 


98.1% 


Pprnoo645 


YP^001709 


78% 


100% 


100% 


ppnioo647 


ARA104991 


36% 


96.5% 


92.3% 


pphioo647 


BPT 100972 


30% 


94.1% 


98.1% 


ppmoo647 


j 100236 


28% 


32.4% 


32.1% 


ppoioo647 


nrp 102700 


33% 


92.2% 


82.3% 


Ppmoo647 


RFTI1034R9 
or \j i vj*to7 


32% 


95.9% 


92.9% 


cpn i (V\AA.H 
C,\^\J IUUW / 


IjiYLrVlVJ /JJO 


31% 


70 3% 


97.8% 


E.V^v/lUU0*r/ 


pn iimn7i 


73% 


53 7% 


55 1% 


■cr7"\1AfWiA7 


PPM7nOA07 


73% 
Zj/o 


05 3% 
yj.o so 


92 1% 

1 /o 


i AAiC/i"? 
bCwlUUOH/ 


PTP7nrtfti n 


x;o /o 


35 7% 
j J. 4 so 


34 9% 


EAsKJ 1 WOH / 


PAP1A1607 


74% 


79 7% 


62 6% 


CLaJIUvO**/ 


PTYPIfil 117 


74% 


Rl 6% 


90 6% 

7V.U /O 


ECO100647 


EBC 102934 


85% 


99.6% 


100% 


ECOl 00647 


ECO100647 


100% 


100% 


100% 


ECO100647 


HIN100288 


44% 


97.1% 


95.4% 


ECO100647 


HPY100177 


23% 


60.5% 


66.8% 


ECO100647 


KPN302711 


83% 


100% 


100% 


ECO 100647 


LPN102781 


30% 


84.6% 


98.6% 


ECO100647 


MCA100961 


36% 


96.3% 


94.9% 
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ECO 100647 


MAV106114 


25% 


97.3% 


82.0% 


ECO 100647 


MB VI 02260 


26% 


97.1% 


94.9% 


ECO 100647 


MLP 100891 


24% 


96.9% 


79.2% 


ECO 100647 


MTU202018 


26% 


97.1% J 


57.9% 


ECO 100647 


NGO101835 


31% 


97.1% 


96.5% 


ECO100647 


NME200849 


31% 


97.1% 


96.5% 


ECO100647 


PMU101032 


45% 


96.3% 


94.0% 


ECO100647 


PRT102866 


61% 


96.5% 


97.2% 


ECO100647 


PAE203981 


40% 


95.7% 


95.1% 


ECO100647 


PPU107601 


41% 


94.3% 


94.7% 


ECO100647 


PSY105070 


41% 


95.1% 


95.7% 


ECO100647 


SPA102458 


85% 


76.2% 


100% 


ECO100647 


STY101797 


87% 


100% 


100% 


ECO100647 


TPA100249 


25% 


69.9% 


72.0% 


ECO100647 


TPA100413 


25% 


75.6% 


76.6% 


ECO100647 


VCH100942 


48% 


99.0% 


99.2% 


ECO100647 


YPS0012U 


65% 


99.6% 


99.2% 


ECO100648 


ABA106059 


43% 


83.6% 


89.6% 


ECO100648 


BFR10598 


34% 


86.6% 


57.9% 


ECO100648 


BPT100973 


42% 


91.8% 


89.8% 


ECO100648 


BBU100059 


37% 


79.8% 


90.7% 


ECO 100648 


BCE102638 


45% 


90.4% 


88.1% 


ECO 100648 


BFU103492 


45% 


88.4% 


82.6% 


ECO100648 


BMA100300 


44% 


95.5% 


89.3% 


ECO100648 


CPN200241 


32% 


70.5% 


53.5% 


ECO 100648 


CTR200696 


30% 


77.4% 


60.7% 


ECO100648 


CDP100549 


33% 


91.1% 


64.0% 


ECO100648 


EBC102935 


94% 


100% 


100% 


ECO100648 


EFM200835 


33% 


85.6% 


56.1% 


ECO100648 


ECO100648 


100% 


100% 


100% 


ECO100648 


HIN 100287 


63% 


97.3% 


95.7% 


ECO100648 


KPN302676 


96% 


100% 


92.7% 


ECO100648 


LPN103355 


47% 


86.3% 


88.4% 


ECO100648 


LMO101012 


30% 


89.0% 


61.8% 


ECOI00648 


MCA100807 


40% 


93.8% 


94.8% 


ECO100648 


MAV100201 


36% 


76.0% 


51.8% 


ECO100648 


MBV102614 


37% 


76.0% 


52.0% 


ECO100648 


MTU202330 


37% 


76.0% 


52.0% 


ECOl 00648 


MGE100148 


26% 


66.8% 


45.5% 


ECO100648 


MPN100672 


25% 


66.8% 


45.5% . 


ECO 100648 


NGO 101904 


51% 


90.8% 


96.4% 


ECO 100648 


NME200659 


51% 


90.8% 


96.4% 


ECO 100648 


PMU101033 


64% 


95.9% 


93% 


ECO 100648 


PRT102899 


84% 


99.3% 


98.6% 


ECO100648 


PAE203980 


51% 


91.4% 


96.1% 


ECO100648 


PPU1 12260 


52% 


91.4% 


96.1% 


ECO100648 


PSY105068 


52% 


91.4% 


95.7% 


ECO100648 


SPA102459 


96% 


100% 


100% 


ECO100648 


STY101798 


97% 


100% 


100% 


ECO100648 


SPN401776 


34% 


76.0% 


49.7% 


ECO100648 


SPY200275 


34% 


79.1% 


52.0% 


ECO100648 


TPA100641 


35% 


78.8% 


86.0% 


ECO 100648 


VCH100943 


66% 


96.9% 


96.6% 


ECO100648 


YPS001216 


86% 


100% 


100% 
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BFR10Q448 


41% 


10.0% 


16.5% 


FCOl 00659 


BPT101695 


33% 


97.7% 


98.5% 


Fcmoofiv? 


FTIC 100475 


77% 


91.8% 


100% 


fcoioo652 


ECO 100652 


100% 


100% 


100% 


ECO 100652 


KPN302701 


78% 


100% 


100% 


ECO 100652 


MBV101815 


24% 


79.8% 


762% 


ECO 100652 


MTU201729 


24% 


80.6% 


68.9% 


FCO 100652 

1 W\J»/.XV 


PMU101002 

X I Tl »^ IU1 WX. 


50% 


98.5% 


98.5% 


FCO 1 0065? 


PRT1 02895 


60% 


98.5% 


97.7% 


fcoioo65? 


SPA 102463 


83% 


100% 


100% 




STY101812 


85% 


100% 


100% 


FCOl 00652 


STM101551 


85% 


100% 


100% 


FCOl 00652 


VCH100947 


52% 


99.7% 


96.7% 


ECO 100652 


YPS001221 

X X kV W X m * * * X 


63% 


99.5% 


992% 


ECO 100659 


BFR10535 


28% 


87.9% 


903% 


ECO 100659 


BMA 106268 


32% 


16.5% 


22.1% 


ECO 100659 


CBO102186 


30% 


74.6% 


96.5% 


ECO 100659 


CDP101325 


27% 


70.4% 


902% 


FCO 100659 


EBC 101470 


91% 


100% 


100% 


ECO 100659 


EFM101256 

X->X iTX J. V X v 


25% 


92.9% 


95.3% 


FCO 100659 


ECO100659 


100% 


100% 


100% 


ECO 1 00659 


KPN301178 

XVX i ^ «/ V i X / w 


88% 


100% 


100% 


FC0 100659 


TMO 100695 


30% 


82.3% 


82.2% 


FCOl 00659 


MAV 103808 


23% 


79.8% 


62.5% 


ECO 100659 


MBV102332 


22% 


79.8% 


78.8% 


ECO 100659 


MTU408202 


22% 


79.8% 


78.8% 


FCO 100659 


PMU100683 


54% 


97.8% 


98.3% 


ECO 100659 


PRT100305 


70% 


98.5% 


98.3% 


FCOl 00659 


PSY101979 


30% 


18.0% 


33.3% 


FCOl 00659 


pcv 105778 

1 J 1 ivj /to 


23% 


49.8% 


60.1% 


FCO 100659 


SPA 10 1689 


93% 


100% 


100% 


ECO 100659 


STY101819 


94% 


100% 


100% 


ECO 100659 


SMU100070 


28% 


74.1% 


96.3% 


ECO100659 


SPN201158 


27% 


80.0% 


83.9% 


ECO 100659 


SPY201178 


29% 


74.9% 


972% 


ECO 100659 


VCH 100976 


60% 


98.3% 


99.0% 


ECO100659 


YPS001229 


78% 


99.8% 


99.0% 


ECO 100661 


BAN1 12647 


36% 


79.7% 


93.1% 


ECO100661 


BAN 106376 


34% 


90.6% 


87.0% 


ECO 100661 


BFR104796 


64% 


98.9% 


97.4% 


ECO 100661 


BBU100151 


60% 


98.5% 


97.8% 


ECO 100661 


BFU102111 

XJX w X V« A X A 


22% 


44.4% 


19.7% 


ECO 100661 


CAC101312 


36% 


92.9% 


100% 


ECO 100661 

1. * V*/ A W W X 


CBO103757 


38% 


94.0% 


98.0% 


ECO 100661 


CDF100159 


38% 


40.6% 


89.9% 


ECO 100661 


CDF1 00928 


38% 


94.0% 


98.0% 


ECO 100661 


CDP101713 


44% 


79.7% 


81.9% 


ECO100661 


EBC101472 


93% 


100% 


100% 


ECOlOOwl 


EFA201130 


34% 


92.9% 


100% 


ECO100661 


EFM201449 


33% 


92.9% 


100% 


ECO100661 


ECO100661 


100% 


100% 


100% 


ECO100661 


HIN100134 


74% 


97.4% 


95.9% 


ECO100661 


KPN301180 


93% 


100% 


100% 


ECO100661 


LM01 02723 


39% 


76.7% 


83.8% 
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FC0 100661 


MLP 100363 


25% 


59.8% 


64.0% 


ECO100661 


MTU401062 


23% 


763% 


80.6% 


ECOl 00661 


MTU406545 


27% 


58.6% 


83.3% 


ECO100661 


PMU 100875 


76% 


100% 


99.6% 


ECO100661 


PRT101157 


75% 


94.0% 


97.3% 


ECO 100661 


SPA101691 


95% 


67.7% 


100% 


ECO100661 


STY101831 


95% 


100% 


100% 


ECO 100661 

A** 1 \s V_/ A WW A 


SAU800569 


35% 


77.1% 


81.3% 


ECO 100661 

AJrV^^*/ A W W A 


SEP200118 


31% 


90.6% 


97.1% 


ECO 100661 


SHA102516 


37% 


82.0% 


90.5% 


EGO100661 

A-/ V^X-r A WW A 


SMU100790 


38% 


77.1% 


84.5% 


ECO100661 


SPN401271 


32% 


902% 


94.2% 


ECO100661 


SPY201070 


35% 


79.3% 


86.8% 


ECO100661 


VCH103743 


79% 


99.6% 


99.6% 


ECO 100661 


YPS001237 


85% 


100% 


100% 


ECO 100662 


BAN102604 


46% 


71.6% 


100% 


ECO100662 


BAN102323 


52% 


71.6% 


100% 


ECO100662 


BCE100659 


55% 


65.6% 


98.9% 


ECO 100662 


BFU102702 


55% 


74.1% 


81.8% 


ECO 100662 

Awv/V/ A W Wfc 


BMA108578 


54% 


73.1% 


98.5% 


ECO 100662 

4Jw w A WW** 


CTR200280 


35% 


10.6% 


18.4% 


ECO100662 


CAC100601 


48% 


71.9% 


98.8% 


ECO 100662 


CBO 100356 


42% 


18.8% 


63.9% 


ECO 100662 

AW W A WW** 


CBO102223 


38% 


99.2% 


99.1% 


ECO 100662 


EBC101577 


45% 


70.7% 


100% 


ECO 100662 


EFA201939 


40% . 


95.4% 


95 6% 


ECOl 00662 

1J W A W WAtf 


EFM200960 


40% 


96.6% 


96 8% 

y\J*0 /O 


ECO 100662 


ECO100662 


100% 


100% 


100% 


ECO 100662 


KPN301181 


88% 


99.8% 


yy* i so 


ECO100662 

AwVwv A vvUVif 


MGE100071 


32% 


2.9% 


70 0% 
/o 


ECO 100662 

AVvV A W W4# 


MPN1 00624 


32% 


2.9% 


71 1% 


ECO 100662 

AwW A W W4w 


PMU1 00876 


43% 


71.9% 


QQ £0Z 

yy >o /o 


ECO 100662 


PRT101158 


48% 


100% 


100% 


ECO100662 


PAE203758 


50% 


71.1% 


QR 4% 


ECO 100662 


SPA101338 


90% 


99.8% 


QQ 7% 


ECO 100662 


STY101832 


92% 


99.8% 


yy. i /o 


ECO 100662 

AwW A W VV** 


STM101571 


92% 


99.8% 


QQ 7% 
yy» 1 /o 


ECO 100662 


SAU800189 


39% 


95.1% 


7J.X/0 


ECO100662 

L<W A W WA» 


SEP201169 


39% 


96.0% 


Q£ AO/* 
7V.U /o 


ECO 100662 


SHA 102201 


38% 


96.0% 


OS QOA 

yo.y /o 


ECO 100662 


SMU100648 


30% 


95.8% 


Oft 


ECO 100662 


SPY201526 

kJA A 4V A i/A>V 


31% 


97.1% 


QA A<>/« 
yo.o /o 


ECO 100662 


VCH 100978 


47% 


71.8% 


07 70A 


ECO 100662 


YPS001239 


52% 


99.8% 


QQ 70/. 


ECO 100663 


ABA100614 


54% 


70.2% 


Qft 1% 
yo. / /o 


ECO 100663 


BFRU594 


52% 


99.3% 


Qfl ^% 


ECO100663 


BPT101559 


50% 


99.1% 


99.5% 


ECO100663 


BCE1 12007 


56% 


98.0% 


97.7% 


ECOl 00663 


BFU109691 


56% 


98.0% 


97.7% 


ECOl 00663 


BMA101111 


57% 


98.2% 


93.5% 


ECO100663 


CDF100047 


54% 


98.4% 


98.7% 


ECO100663 


EBC101578 


94% 


100% 


100% 


ECO100663 


ECO100663 


100% 


100% 


100% 


ECO 100663 


HIN101320 


76% 


97.7% 


97.1% 
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xiomoiog locus 
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ECO 100663 


VPXJIAAAOO 


st* /0 


/ /O 


100% 


ECO 100663 


LrIN10z2S4 


JJ /o 


7 y% 
/.z/o 


20 1% 


ECO 100663 


T DXTI AAC1 H 
i-riN 1UU 3 1 / 


JO/0 


77.0/0 


99.8% 


ECO 100663 


Mw\lL/Uo31 


Jl/0 


Q0 1% 
77. x /o 


99.5% 


ECO 100663 


XTfJA 1 A 1 OQ1 
INVJVlUlZy 1 


DU /0 


Q7 *5% 


97.7% 


ECO 1 00663 


XT\yfT30A1 /CAO 
iNMbzUlOU/ 


ou /o 


07 ^% 
y / .j /o 


95.8% 


ECO 100663 


DkvfT T1 AACOQ 

rMU 1UU3ZB 


/ / /0 ! 


OR 0% 
yo.w /o 


97 1% 


ECO 100663 


nnTi ACAOO 
rKl 103OZ7 


oD/o 


1 AA0/ 


99 6% 

77.V /o 


ECO 100663 


rAE2017y3 


£A0/L 
OU/o 


AA 10/ 

yy. 1 /o 


yy. 1 /o 


ECO 100663 


DOT T1 AOA1 1 

rrUlUoyil 


AO0/ 1 


QR £0/ 

y 0.0/0 


Oft R% 

7U.O /0 
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